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RNA dependant RNA Polmerases 



Telomerases 



ms DNAs 



Sc-Est2p 
Ea-p123 
Sp-Trtlp 
hTRT 



Mito. plasmid/RTL 



Group II Introns 



Non-LTR Retrotransposons 



Hepadnaviruses 



LTR Retrotransposons (Copia-Ty1 Subgroup) 
LTR Retrotransposons (Gypsy-TyS Subgroup) 



Caulinnoviruses 



Retroviruses 



FIG. 6 



• 



6/103 



lamda GphiS 
44022bp 



NC01 19329 
,NC01 19932 
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SPLA 2202 
SNAB1 2220 



ISPLA 22021 
ISNABI 2220 




HPA1 4874 

AVR2 5699 
AGE1 5915 
EC047III 6078 

MLU1 7020 

SPH1 7746 
\SAL1 7980 
\BAMH1 7992 
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A= pGRN144 
B = pSEAP2-Basic 
C = BGL2-EC047III fragment from A 
into BGL2-NRU1 sites of B 
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? ??? 
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FIG. 1 0B 
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181 GGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGC 
CCTGGGCCGCCGAAAGGCGCGCGACCACCGGGTCACGGACCACACGCACGGGACCCTGCG 



NFkB_CSl 
GGGRQTYYQC 
NFkB-MHC-1.2 
TGGGCTTCCCC 

241 ACGGCCGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTGGGCCTCCCCGGGGTCGGCGTCCG 
TGCCGGCGGGGGGCGGCGGGGGAGGAAGGCGGTCCACCCGGAGGGGCCCCAGCCGCAGGC 

Intronl 

301 GCTGGGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTC 
CGACCCCAACTCCCGCCGGCCCCCCTTGGTCGCTGTACGCCTCTCGTCGCGTCCGCTGAG 

NFkB_CSl 
GGGRQTYYQC 
NFkB_CS2 
RGGGKMTYYCC 

Topo„I I_c 1 e a vage_s i t e 
RNYKWCNNGYNGKTNYNY 

3 61 AGGGCGCTTCCCCCGCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGG 
TCCCGCGAAGGGGGCGTCCACAGGACGGACTTCCTCGACCACCGGGCTCACGACGTCTCC 



FIG. 12 
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1 AAAACCCCAA AACCCCAAAA CCCCTTTTAG AGCCCTGCAG TTGGAAATAT 

51 , AACCTCAGTA TTAATAAGCT CAGATTTTAA ATATTAATTA CAAAACCTAA 

101. ATGGAGGTTG ATGTTGATAA TCAAGCTGAT AATCATGGCA TTCACTCAGC 

151 TCTTAAGACT TGTGAAGAAA TTAAAGAAGC TAAAACGTTG TA'CTCTTGGA 

201 TCCAGAAAGT TATTAGATGA AGAAATCAAT CTCAAAGTCA TTATAAAGAT 

251 TTAGAAGATA TTAAAATATT TGCGCAGACA AATATTGTTG CTACTCCACG 

301. AGACTATAAT GAAGAAGATT TTAAAGTTAT TGCAAGAAAA GAAGTATTTT 

351 CAACTGGACT AATGATCGAA CTTATTGACA AATGCTTAGT TGAACTTCTT 

401 TCATCAAGCG ATGTTTCAGA TAGACAAAAA CTTCAATGAT TTGGATTTCA 

451 ACTTAAGGGA AATCAATTAG CAAAGACCCA TTTATTAACA GCTCTTTCAA 

501 CTCAAAAGCA GTATTTCTTT CAAGACGAAT GGAACCAAGT TAGAGCAATG 

551 ATTGGAAATG AGCTCTTCCG ACATCTCTAC ACTAAATATT TAATATTCCA 

601 GCGAACTTCT GAAGGAACTC TTGTTCAATT TTGCGGGAAT AACGTTTTTG 

651 ATCATTTGAA AGTCAACGAT AAGTTTGACA AAAAGCAAAA AGGTGGAGCA 

701 GCAGACATGA ATGAACCTCG ATGTTGATCA ACCTGCAAAT ACAATGTCAA 

751 GAATGAGAAA GATCACTTTC TCAACAACAT CAACGTGCCG AATTGGAATA 

801 ATATGAAATC AAGAACCAGA ATATTTTATT GCACTCATTT TAATAGAAAT 

851 AACCAATTCT TCAAAAAGCA TGAGTTTGTG AGTAACAAAA ACAATATTTC 

901 AGCGATGGAC AGAGCTCAGA CGATATTCAC GAATATATTC AGATTTAATA 

951 GAATTAGAAA GAAGCTAAAA GATAAGGTTA TCGAAAAAAT TGCCTACATG 

1001 CTTGAGAAAG TCAAAGATTT TAACTTCAAC TACTATTTAA CAAAATCTTG 

1051 TCCTCTTCCA GAAAATTGGC GGGAACGGAA ACAAAAAATC GAAAACTTGA 

1101 TAAATAAAAC TAGAGAAGAA AAGTCGAAGT ACTATGAAGA GCTGTTTAGC 

1151 TACACAACTG ATAATAAATG CGTCACACAA TTTATTAATG AATTTTTCTA 

1201 CAATATACTC CCCAAAGACT TTTTGACTGG AAGAAACCGT AAGAATTTTC 

1251 AAAAGAAAGT TAAGAAATAT GTGGAACTAA ACAAGCATGA ACTCATTCAC 

1301 AAAAACTTAT TGCTTGAGAA GATCAATACA AGAGAAATAT CATGGATGCA 

1351 GGTTGAGACC TCTGCAAAGC ATTTTTATTA TTTTGATCAC GAAAACATCT 

1401 ACGTCTTATG GAAATTGCTC CGATGGATAT TCGAGGATCT CGTCGTCTCG 

1451 CTGATTAGAT GATTTTTCTA TGTCACCGAG CAACAGAAAA GTTACTCCAA 

1501 AACCTATTAC TACAGAAAGA ATATTTGGGA CGTCATTATG AAAATGTCAA 

1551 TCGCAGACTT AAAGAAGGAA ACGCTTGCTG AGGTCCAAGA AAAAGAGGTT 

1601 GAAGAATGGA AAAAGTCGCT TGGATTTGCA CCTGGAAAAC TCAGACTAAT 

1651. ACCGAAGAAA ACTACTTTCC GTCCAATTAT GACTTTCAAT AAGAAGATTG 

1701 TAAATTCAGA CCGGAAGACT ACAAAATTAA CTAC AAATAC GAAGTTATTG 

1751 AACTCTCACT TAATGCTTAA GACATTGAAG AATAGAATGT TTAAAGATCC 

1801 TTTTGGATTC GCTGTTTTTA ACTATGATGA TGTAATGAAA AAGTATGAGG 

1851 AGTTTGTTTG CAAATGGAAG CAAGTTGGAC AACCAAAACT CTTCTTTGCA 

1901 ACTATGGATA TCGAAAAGTG ATATGATAGT GTAAACAGAG AAAAACTATC 

1951 AACATTCCTA AAAACTACTA AATTACTTTC TTCAGATTTC TGGATTATGA 

2001 CTGCACAAAT TCTAAAGAGA AAGAATAACA TAGTTATCGA TTCGAZ^AAAC 

2051 TTTAGAAAGA AAGAAATGAA AGATTATTTT AGACAGAAAT TCCAGAAGAT 

2101 TGCACTTGAA GGAGGACAAT ATCCAACCTT ATTCAGTGTT CTTGAAAATG 

2151 AACAAAATGA CTTAAATGCA AAGAAAACAT TAATTGTTGA AGCAAAGCAA 

2201 AGAAATTATT TTAAGAAAGA TAACTTACTT CAACCAGTCA TTAATATTTG 

2251 CCAATATAAT TACATTAACT TTAATGGGAA GTTTTATAAA CAAACAAAAG 

2301 GAATTCCTCA AGGTCTTTGA GTTTCATCAA TTTTGTCATC ATTTTATTAT 

2 '3 51 GCAACATTAG AGGAAAGCTC CTTAGGATTC CTTAGAGATG AATCAATGAA 

FIG, 13 ^ 
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2401 CCCTGAAAAT CCAAATGTTA ATCTTCTAAT GAGACTTACA GATGACTATC 

2451 TTTTGATTAC AACTCAAGAG AATAATGCAG TATTGTTTAT TGAGAAACTT 

2501 ATAAACGTAA GTCGTGAAAA TGGATTTAAA TTCAATATGA AGAAACTACA 

2551 GACTAGTTTT CCATTAAGTC CAAGCAAATT TGCAAAATAC GGAATGGATA 

2601 GTGTTGAGGA GCAAAATATT GTTCAAGATT ACTGCGATTG GATTGGCATC 

2651 TCAATTGATA TGAAAACTCT TGCTTTAATG CCAAATATTA ACTTGAGAAT 

2701 AGAAGGAATT CTGTGTACAC TCAATCTAAA CATGCAAACA AAGAAAGCAT 

2751 CAATGTGGCT CAAGAAGAAA CTAAAGTCGT TTTTAATGAA TAACATTACC 

2801 CATTATTTTA GAAAGACGAT TACAACCGAA GACTTTGCGA ATAAAACTCT 

2851 CAACAAGTTA TTTATATCAG GCGGTTACAA ATACATGCAA TGAGCCAAAG 

2901 AATACAAGGA CCACTTTAAG AAGAACTTAG CTATGAGCAG TATGATCGAC- 

2951 TTAGAGGTAT CTAAAATTAT ATACTCTGTA ACCAGAGCAT TCTTTAAATA 

3 001 CCTTGTGTGC AATATTAAGG ATACAATTTT TGGAGAGGAG CATTATCCAG 

3 051 ACTTTTTCCT TAGCACACTG AAGCACTTTA TTGAAATATT CAGCACAAAA 

3101 AAGTACATTT TCAACAGAGT TTGCATGATC CTCAAGGCAA AAGAAGCAAA 

3151 GCTAAAAAGT GACCAATGTC AATCTCTAAT TCAATATGAT GCATAGTCGA 

3201 CTATTCTAAC TTATTTTGGA AAGTTAATTT TCAATTTTTG TCTTATATAC 

3251 TGGGGTTTTG GGGTTTTGGG GTTTTGGGG 

FIG. 13 

(CONTINUED) 



1 MEVDVDNQAD NHGIHSALKT CEEIKEAKTL YSWIQKVIRC RNQSQSHYKD 

51 LEDIKIFAQT NIVATPRDYN EEDFKVIARK EVFSTGLMIE LIDKCLVELL 

101 SSSDVSDRQK LQCFGFQLKG NQLAKTHLLT ALSTQKQYFF QDEWNQVRAM 

151 IGNELFRHLY TKYLIFQRTS EGTLVQFCGN NVFDHLKVND KFDKKQKGGA 

201 ADMNEPRCCS TCKYNVKNEK DHFLNNINVP -NWNNMKSRTR IFYCTHFNRN 

251 NQFFKKHEFV SNKNNISAMD RAQTIFTNIF RFNRIRKKLK DKVIEKIAYM 

301 LEKVKDFNFN YYLTKSCPLP ENWRERKQKI ENLINKTREE KSKYYEELFS 

351 YTTDNKCVTQ FINEFFYNIL PKDFLTGRNR KNFQKKVKKY VELNKHELIH 

401 KNLLLEKINT REISWMQVET SAKHFYYFDH ENIYVLWKLL RWIFEDLWS 

451 LIRCFFYVTE QQKSYSKTYY YRKNIWDVIM KMSIADLKKE TLAEVQEKEV 

501 EEWKKSLGFA PGKLRLIPKK TTFRPIMTFN KKIVNSDRKT TKLTTNTKLL 

551 NSHLMLKTLK NRMFKDPFGF AVFNYDDVMK KYEEFVCKWK QVGQPKLFFA 

601 TMDIEKCYDS VNREKLSTFL KTTKLLSSDF WIMTAQILKR KNNIVIDSKN 

651 FRKKEMKDYF RQKFQKIALE GGQYPTLFSV LENEQNDLNA KKTLIVEAKQ 

701 KNYFKKDNLL QPVINICQYN YINFNGKFYK QTKGIPQGLC VSSILSSFYY 

751 ATLEESSLGF LRDESMNPEN PNVNLLMRLT DDYLLITTQE NNAVLFIEKL 

801 INVSRENGFK FNMKKLQTSF PLSPSKFAKY GMDSVEEQNI VQDYCDWIGI 

851 SIDMKTLALM PNINLRIEGI LCTLNLNMQT KKASMWLKKK LKSFLMNNIT 

901 HYFRKTITTE DFANKTLNKL FISGGYKYMQ CAKEYKDHFK KNLAMSSMID 

951 LEVSKIIYSV TRAFFKYLVC NIKDTIFGEE HYPDFFLSTL KHFIEIFSTK 

1001 KYIFNRVCMI LKAKEAKLKS DQCQSLIQYD A 



F/G. 14 
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MPRAPRCRAVRSLLRSHYREVLPLATFVRRLGPQGWRLVQRGDP 
AAFRALVAQCLVCVPWDARPPPAAPSFRQVSCLKELVARVLQRL 
CERGAKJSrVLAFGFALLDGARGGPPEAFTTSVRSYLPNTVTDALR 
GSGAWGLLLRRVGDDVLVHLIARCALFVLVAPSCAYQVCGPPLY 
QLGAATQARPPPHASGPRRRLGCERAWNHSVREAGVPLGLPAPG 
ARRRGGSASRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRTRG 
PSDRGFCWSPARPAEEATSLEGALSGTRHSHPSVGRQHHAGPP 
^ STSRPPRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLSSLRP 
SLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPLFLEL 
LGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEE 
EDTDPRRLVQLLRQHSSPWQVYGFVRACLRRLVPPGLWGSRHNE 
RRFLRNTKKF I SLGKHAKLSLQELTWKMS VRDC AWLRRS PGVGC 
VPAAEHRLREEILAKFLHWLMSVYWELLRSFFYVTETTFQKNR 
LFFYRKSVWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPAL 
LTSRLRFIPKPDGLRPIVNMDYWGARTFRREKRAERLTSRVKA 
LFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRVRAQDPPP 
ELYFVKVDVTGAYDTI PQDRLTEVI AS I IKPQNTYCVRRYAWQ 
KAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAWI 
EQSSSLNEASSGLFDVFLRFMCHflAVRIRGKSYVQCQGIPQGSI 
L STLLC SLC YGDMENKLFAGI RRDGLLLRLVDDFLLVTPHLTHA 
KTFLRTLVRGVPEYGCWNLRKTWNFPVEDEALGGTAFVQMPA 
HGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGR 
NMRRKLFGVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAYRF 
HACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKNAGMSL 
GAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQ 
TQLSRKLPGTTLTALEAAANPALPSDFKTILD 

FIG. 17 

GGCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTT 
TTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTG 
GAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCT 
GTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACT 
CCGCTTCATCCCCAAGCCTGACGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGG 
AGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACT 
GTTCAGCGTGCTCAACTACGAGCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCT 
GGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGA 
CCCGCCGCCTGAGCTGTACTTTGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCC 
CCAGGACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGT 
GCGTCGGTATGCCGTGGTCCAGAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAG 
CCACGTCCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCT 
GCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGC 
TGCTCCTGCGTTTGGTGGATGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAA 
CCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGA 
AGACAGTGGTGAACTTCCCTGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGA 
TGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGG 
TGCAGAGCGACTACTCCAGCTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACC 
GCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGT 
GTCACAGCCTGTTTCTGGATTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAACATCT 
ACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCATTTC 
ATCAGCAAGTTTGGAAGAACCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCTCCC 
TCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCG 
CCGGCC7TCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTCAAGC 
TGACTCGACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAGACGC 
AGCTGAGTCGGAAGCTCCCGGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAACCCGG 
CACTGCCCTCAGACTTCAAGACCATCCTGGACTGATGGCCACCCGCCCACAGCCAGGCCG 
AGAGCAGACACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGC 
CCACACCCAGGCCTGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCT 
GCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGC 
TGAGTGTCCAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCC 
AGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATC 
CCCAGATTCGCCATTGTTCACCCCTCGCCCTGCCCTCCTTTGCCTTCCACCCCCACCATC 
CAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGT 
GCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGG 
GGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGOAAAAAAAAAA 
AAAAAAAAAAAAAAAA 
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MetSerValTyrValValGluLeuLeuArgSerPhePhe 
TyrValThrGluThrThrPheGlnLysAsnArgLeuPhe 
PheTyrArgLysSerValTrpSerLysLeuGlnSerlle 
GlylleArgGlnHisLeuLysArgValGlnLeuArgGlu 
LeuSerGl\iAlaGluValArgGlnHisArgGluAlaArg 
ProAlaLeuLeuThrSerArgLeuArgPhelleProLys 
ProAspGlyLeuArgProIleValAsnMetAspTyrVal 
ValGlyAlaArgThrPheArgArgGluLysArgAlaGlu 
ArgLeuThrSerArgValLysAlaLeuPheSerValLeu 
AsnTyrGluArgAlaArgArgProGlyLeuLeuGlyAla 
SerValLeuGlyLeuAspAspIleHisArgAlaTrpArg 
ThrPheValLeuArgValArgAlaGlnAspProProPro 
GluLeuTyrPheValLysValAspValThrGlyAlaTyr 
AspThrlleProGlnAspArgLeuThrGluVallleAla 
SerllelleLysProGlnAsnThrTyrCysValArgArg 
TyrAlaValValGlnLysAlaAlaHisGlyHisValArg 
LysAlaPheLysSerHisValLeiiArgProValProGly 
AspPr o Al aGlyLeuHi s Pr oLeuHi s Al aAl aLeuGln 
. ProValLe\iArgArgHisGlyGluGlnAlaValCysGly 
AspSerAlaGlyArgAlaAlaProAlaPheGlyGly 

FIG, 19 

1 

met 

. GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 



pro arg ala pro arg cys 
CCG CGC GCT CCC CGC TGC 

20 

his tyr arg glu val leu 
CAC TAG CGC GAG GTG CTG 



gly pro gin gly trp arg 
GGG CCC CAG GGC TGG CGG 

50 

phe arg ala leu val ala 
TTC CGC GCG CTG GTG GCC 



ala arg pro pro pro ala 
GCA CGG CCG CCC. CCC GCC 

80 

leu lys glu leu val ala 
CTG AAG GAG CTG GTG GCC 



gly ala lys asn val leu 
GGC GCG AAG AAC GTG CTG 

110 

ala arg gly gly pro pro 
GCC CGC GGG GGC CCC CCC 



arg ala val 
CGA GCC GTG 



pro leu ala 
CCG CTG GCC 

40 

leu val gin 
CTG GTG CAG 



gin cys leu 
CAG TGC CTG 

70 

ala pro ser 
GCC CCC TCC 



arg val leu 
CGA GTG CTG 

100 

ala phe gly 
GCC TTC GGC 



glu ala phe 
GAG GCC TTC 



arg ser leu 
CGC TCC CTG 



thr phe val 
ACG TTC GTG 



arg gly asp 
CGC GGG GAC 



val cys val 
GTG TGC GTG 



phe arg gin 
TTC CGC CAG 



gin arg leu 
CAG AGG CTG 



phe ala leu 
TTC GCG CTG 



thr thr ser 
ACC ACC AGC 



leu arg ser 
CTG CGC AGC 

30 

arg arg leu 
CGG CGC CTG 



pro ala ala 
CCG GCG GCT 

60 

pro trp asp 
CCC TGG GAC 



val ser cys 
GTG TCC TGC 

90 

cys glu arg 
TGC GAG CGC 



leu asp gly 
CTG GAC GGG 

120 
val arg ser 
GTG CGC AGC 
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tyr leu pro asn thr val thr 
TAG CTG CCC AAC ACQ GTG ACC 

140 

trp gly leu leu leu arg arg 
TGG GGG CTG CTG CTG CGC CGC 



leu leu ala arg cys ala leu 
CTG CTG GCA CGC TGC GCG CTC 

170 

ala tyr gin val cys gly pro 
GCC TAC CAG GTG TGC GGG CCG 



thr gin ala arg pro pro pro 
ACT CAG GCC CGG CCC CCG CCA 

200 

leu gly cys glu arg ala trp 
CTG GGA TGC GAA CGG GCC TGG 



-val pro leu gly leu pro ala 
GTC CCC CTG GGC CTG CCA GCC 

230 

ser ala ser arg ser leu pro 
AGT GCC AGC CGA AGT CTG CCG 



ala ala pro glu pro glu arg 
GCT. GCC CCT GAG CCG GAG CGG 

260 

ala his pro gly arg thr arg 
GCC CAC CCG GGC AGG ACG CGT 



val val ser pro ala arg pro 
GTG GTG TCA CCT GCC AGA CCC 

290 

gly ala leu ser gly thr arg 
GGT GCG CTC TCT GGC ACG CGC 



gin his his ala gly pro pro 

CAG CAC CAC GCG GGC CCC CCA 

^ 320 

trp asp thr pro cys pro pro 

TGG GAC ACG CCT TGT CCC CCG 



130 

asp ala leu arg gly ser gly ala 
GAC GCA CTG CGG GGG AGC GGG GCG 

150 

val gly asp asp val leu val his 
GTG GGC GAC GAC GTG CTG GTT CAC 

160 

phe val leu val ala pro ser cys 
TTT GTG CTG GTG GCT CCC AGC TGC 

180 

pro leu tyr gin leu gly ala ala 
CCG CTG TAC CAG CTC GGC GCT GCC 

190 

his ala ser gly pro arg arg arg 
CAC GCT AGT GGA CCC CGA AGG CGT 

210 

asn his ser val arg glu ala gly 
AAC CAT AGC GTC AGG GAG GCC GGG 

220 

pro gly ala arg arg arg gly gly 
CCG GGT GCG AGG AGG CGC GGG GGC 

240 

leu pro lys arg pro arg arg gly 
TTG CCC AAG AGG CCC AGG CGT GGC 

250 

thr pro val gly gin gly ser trp 
ACG CCC GTT GGG CAG GGG TCC TGG 

270 

gly pro ser asp arg gly phe cys 
GGA CCG AGT GAC CGT GGT TTC TGT 



280 

ala glu glu ala thr ser leu glu 
GCC GAA GAA GCC ACC TCT TTG GAG 

300 

his ser his pro ser val gly arg 
CAC TCC CAC CCA TCC GTG GGC CGC 

310 

ser thr ser arg pro pro arg pro 
TCC ACA TCG CGG CCA CCA CGT CCC 

330 

val tyr ala glu thr lys his phe 
GTG TAC GCC GAG ACC AAG CAC TTC 
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340 

leu tyr ser ser gly asp lys glu gin leu arg pro ser phe leu 

CTC TAG TCC TCA GGC GAG AAG GAG GAG CTG CGG CCC TGG TTC GTA 

350 360 

leu ser ser leu arg pro ser leu thr gly ala arg arg leu val 

CTC AGC TCT CTG AGG CCC AGC CTG ACT GGC GCT CGG AGG CTC GTG 

370 

glu thr ile phe leu gly ser arg pro trp met pro gly thr pro 

GAG ACC ATC TTT CTG GGT TCC AGG CCC TGG ATG CCA GGG ACT CCC 

380 390 

arg arg leu pro arg leu pro gin arg tyr trp gin met arg pro 

CGG AGG TTG CCC CGC CTG CCC CAG CGC TAG TGG CAA ATG CGG . CCC 

400 

leu phe .leu glu leu leu gly asn his ala gin cys pro tyr gly 

CTG TTT CTG GAG CTG CTT GGG AAC CAC GCG CAG TGC CCC TAG GGG 

410 420 

val leu leu lys thr his cys pro leu arg ala ala val thr pro 

GTG CTC CTC AAG ACG CAC TGC CGG CTG CGA GCT GCG GTC ACC CCA 

430 

ala ala gly val cys ala arg glu lys pro gin gly ser val ala 

GCA GCG GGT GTC TGT GCG CGG GAG AAG CCC CAG GGC TCT GTG GCG 

440 450 

ala pro glu glu glu asp thr asp pro arg arg leu val gin leu 

GCG CCC GAG GAG GAG GAC ACA GAC CCC GGT CGC CTG GTG CAG CTG 

460 

leu arg gin his ser ser pro trp gin val tyr gly phe val arg 

CTC CGC CAG CAC AGC AGC CCC TGG CAG GTG TAG GGC TTC GTG CGG 

470 480 

ala cys leu arg arg leu val pro pro gly leu trp gly ser arg 

GCG TGC CTG CGC CGG CTG GTG CCC CCA GGC CTC TGG GGC TCC AGG' 

490 

his asn glu arg arg phe leu arg asn thr lys lys phe ile ser 

CAC AAC GAA CGC CGC TTC CTC AGG AAC ACC AAG AAG TTC ATC TCC 

500 510 

leu gly lys his ala lys leu ser leu gin glu leu thr trp lys 

CTG GGG AAG CAT GGC AAG CTC TGG CTG CAG GAG CTG ACG TGG AAG 

520 

met ser val arg asp cys ala trp leu arg arg ser pro gly val 

ATG AGC GTG CGG GAC TGC GCT TGG CTG CGC AGG AGC CCA GGG GTT 



530 540 
gly cys val pro ala ala glu his arg leu arg glu glu ile leu 
GGC TGT GTT CGG GGC GCA GAG CAC GGT CTG CGT GAG GAG ATC CTG 
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ala 
GCC 


lys 
AAG 


phe 
TTC 


leu 
CTG 


his 
CAC 


trp 
TGG 


leu 
CTG 


leu 
CTC 


arg 
AGG 


ser 
TCT 


560 
phe 
TTC 


phe 
TTT 


tvr 
TAT 


val 
GTC 


arg 
AGG 

ile 
ATT 


leu 
CTC 

gly 
GGA 


phe 
TTT 

ile 
ATC 


phe 
TTC 
590 
arg 
AGA 


tyr 
TAC 

gin 
CAG 


arg 
CGG 

his 
CAC 


lys 
AAG 

leu 
TTG 


ser 
TCG 


glu 
GAA 


ala 
GCA 


glu 
GAG 


val 
GTC 


arg 
AGG 


gin 
CAG 


leu 
CTG 


thr 
ACG 


ser 
TCC 


620 
arg 
AGA 


leu 
CTC 


arg 
CGC 


phe 
TTC 


pro 
CCG 


ile 
ATT 


val 
GTG 


asn 
AAC 


met 
ATG 


asp 
GAC 


tyr 
TAC 


arg 
AGA 


glu 
GAA 


lys 
AAG 


650 
arg 
AGG 


ala 
GCC 


glu 
GAG 


arg 
CGT 


phe 
TTC 


ser 
AGC 


val 
GTG 


leu 
CTC 


asn 
AAC 


tyr 
TAC 


glu 
GAG 


gly 
GGC 


ala 
GCC 


ser 
TCT 


680 
val 
GTG 


leu 
CTG 


gly 
GGC 


leu 
CTG 


thr 
ACC 


phe 
TTC 


val 
GTG 


leu 
CTG 


arg 
CGT 


val 
GTG 


arg 
CGG 


tyr 
TAC 


phe 
TTT 


val 
GTC 


710 
Ivs 
AAG 


val 
GTG 


asp 
GAT 


val 
GTG 


gin 
CAG 


asp 
GAC 


arg 
AGG 


leu 
CTC 


thr 
ACG 


glu 
GAG 


val 
GTC 


asn 
AAC 


thr 
ACG 


tyr 
TAC 


740 
cys 
TGC 


val 
GTG 


arg 
CGT 


arg 
CGG 



550 

met ser val tyr val val glu leu 
ATG AGT GTG TAC GTC GTC GAG CTG 

570 

thr glu thr thr phe gin lys asn 
ACG GAG ACC ACG TTT CAA AAG AAC 

580 

ser val trp ser lys leu gin ser 
AGT GTC TGG AGC AAG TTG CAA AGC 

600 

lys arg val gin leu arg glu leu 
AAG AGG GTG CAG CTG CGG GAG CTG 

610 

his arg glu ala arg pro ala leu 
CAT CGG GAA GCC AGG CCC GCC CTG 

630 

ile pro lys pro asp gly leu arg 
ATC CCC AAG CCT GAC GGG CTG CGG 

640 

val val gly ala arg thr phe arg 
GTC GTG GGA GCC AGA ACG TTC CGC 

660 

leu thr ser arg val lys ala leu 
CTC ACC TCG AGG GTG AAG" GCA CTG 

670 

arg ala arg arg pro gly leu leu 
CGG GCG CGG CGC CCC GGC CTC CTG 

690 

asp asp ile his arg ala trp arg 
GAC GAT ATC CAC AGG GCC TGG CGC 

700 

ala gin asp pro pro pro glu leu 
GCC CAG GAC CCG CCG CCT GAG CTG 

720 

thr gly ala tyr asp thr ile pro 
ACG GGC GCG TAC GAC ACC ATC CCC 

730 

ile ala ser ile ile lys pro gin 
ATC GCC AGC ATC ATC AAA CCC CAG 

750 

tyr ala val val gin lys ala ala 
TAT GCC GTG GTC CAG AAG GCC GCC 
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his gly his val 
CAT GGG CAC GTC 

770 

val pro gly asp 
GTG CCA GGG GAT 



gin pro val leu 
CAG CCT GTG CTA 

800 

ser ala gly arg 
TCG GCG GGA CGG 

GACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGA 
GTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCCTGTAGAAGACGAGGC 
CCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCT 
GCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAGCTATGCCCGGACCTC 
CATCAGAGCCAGTCTCACCTTCAACCGCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAA 
ACTCTTTGGGGTCTTGCGGCTGAAGTGTCACAGCCTGTTTCTGGATTTGCAGGTGAACAG 
CCTCCAGACGGTGTGCACCAACATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCA 
CGCATGTGTGCTGCAGCTCCCATTTCATCAGCAAGTTTGGAAGAACCCCACATTTTTCCT 
GCGCGTCATCTCTGACACGGCCTCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGG 
GATGTCGCTGGGGGCCAAGGGCGCCGCCGGCCCTCTGCCCTCCGAGGCCGTGCAGTGGCT 
GTGCCACCAAGCATTCCTGCTCAAGCTGACTCGACACCGTGTCACCTACGTGCCACTCCT 
GGGGTCACTCAGGACAGCCCAGACGCAGCTGAGTCGGAAGCTCCCGGGGACGACGCTGAC 
TGCCCTGGAGGCCGCAGCCAACCCGGCACTGCCCTCAGACTTCAAGACCATCCTGGACTG 
ATGGCCACCCGCCCACAGCCAGGCCGAGAGCAGACACCAGCAGCCCTGTCACGCCGGGCT 
CTACGTCCCAGGGAGGGAGGGGCGGCCCACACCCAGGCCCGCACCGCTGGGAGTCTGAGG 
CCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAG 
GCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCCGTCTTCACTTCCC 
CACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTC 
CACTCCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCCTCGCCCTGCC 
CTCCTTTGCCTTCCACCCCCACCATCCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTC 
TGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGAT 
GGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATG 
AGTTTTTCAGTTTTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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760 

arg lys ala phe lys ser his val leu arg pro 
CGC AAG GCC TTC AAG AGC CAC GTC CTA CGT CCA 

780 

pro ala gly leu his pro leu his ala ala leu 
CCC GCA GGG CTC CAT CCT CTC CAC GCT GCT CTG 

- 790 

arg arg his gly glu gin ala val cys gly asp 
CGG CGA CAT GGA GAA CAA GCT GTT TGC GGG GAT 

807 

ala ala pro ala phe gly gly OP 
GCT GCT CCT GCG TTT GGT GGA TGA TTTCTTGTTGGT 



28/103 

1 CCATGGGACCCACTGCAGGGGCAGCTGGGAGGCTGCAGGCTTCAGGTCCCAGTGGGGTTG 
GGTACCCTGGGTGACGTCCCCGTCGACCCTCCGACGTCCGAAGTCCAGGGTCACCCCAAC 

6 1 CCATCTGCCAGTAGAAACCTGATGTAGAATCAGGGCGCGAGTGTGGACACTGTCCTGAAT 
GGTAGACGGTCATCTTTGGACTACATCTTAGTCCCGCGCTCACACCTGTGACAGGACTTA 

121 CTCAATGTCTCAGTGTGTGCTGAAACATGTAGAAATTAAAGTCCATCCCTCCTACTCTAC 
GAGTTACAGAGTCACACACGACTTTGTACATCTTTAATTTCAGGTAGGGAGGATGAGATG 

181 TGGGATTGAGCCCCTTCCCTATCCCCCCCCAGGGGCAGAGGAGTTCCTCTCACTCCTGTG 
ACCCTAACTCGGGGAAGGGATAGGGGGGGGTCCCCGTCTCCTCAAGGAGAGTGAGGACAC 

241 GAGGAAGGAATGATACTTTGTTATTTTTCACTGCTGGTACTGAATC 

CTCCTTCCTTACTATGAAACAATAAAAAGTGACGACCATGACTTAGGTGACAAAGTAAAC 



301 TTGGTTTGTTTGTTTTGTTTTGAGAGGCGGTTTCACTCTTGTT^^ 

AACCAAACAAACAAAACAAAACTCTCCGCCAAAGTGAGAACAACGAGTCCGACCTCCCTC 



361 TGCAATGGCGCGATCTTGGCTTACTGCAGCCTCTGCCTCCCAGGTTCAAGTGATTCTCCT 
ACGTTACCGCGCTAGAACCGAATGACGTCGGAGACGGAGGGTCCAAGTTCACTAAGAGGA 

alu 

421 GCTTCCGCCTCCCATTTGGCTGGGATTACAGGCACCCGCCACCATGCCCAGCTAATTTTT 
CGAAGGCGGAGGGTAAACCGACCCTAATGTCCGTGGGCGGTGGTACGGGTCGATTAAAAA 



481 TGTATTTTTAGTAGAGACGGGGGTGGGGGTGGGGTTCACC ATGTTGGCCAGGCTGGTCTC 
ACATAAAAATCATCTCTGCCCCCACCCCCACCCCAAGTGGTACAACCGGTCCGACCAGAG 

CAP 



541 GAACTTCTGACCTCAGATGATCCACCTGCCTCTGCCTCCTAAAGTGCTGGGATTACAGGT 
CTTGAAGACTGGAGTCTACTAGGTGGACGGAGACGGAGGATTTCACGACCCTAATGTCCA 



601 GTGAGCCACCATGCCCAGCTCAGAATTTACTCTGTTTAGAAACATCTGGGTCTGAGGTAG 
CACTCGGTGGTACGGGTCGAGTCTTAAATGAGACAAATCTTTGTAGACCCAGACTCCATC 

CCAAT 

661 GAAGCTCACCCCACTCAAGTGTTGTGGTGTTTTAAGCCAATGATAGAATTT^ 

CTTCGAGTGGGGTGAGTTCACAACACCACAAAATTCGGTTACTATCTTAAAAAAATAACA 

721 TGTTAGAACACTCTTGATGTTTTAC ACTGTGATGACTAAGACATCATCAGCTTTTCAAAG 
ACAATCTTGTGAGAACTACAAAATGTGACACTACTGATTCTGTAGTAGTCGAAAAGTTTC 
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CAP 

781 ACACACTAACTGCACCCATAATACTGGGGTGTCTTCTGGGTATC AGCGATCTTC ATTC 
TGTGTGATTGACGTGGGTATTATGACCCCACAGAAGACCCATAGTCGCTAGAAGTAACTT 

CAP 

841 TGCCGGGAGGCGTTTCCTCGCCATGCACATGGTGTTAATTACTCCAGCATAATCTTCTGC 
ACGGCCCTCCGCAAAGGAGCGGTACGTGTACCACAATTAATGAGGTCGTATTAGAAGACG 



901 TTCCATTTCTTCTCTTCCCTCTTTTAAAATTGTGTTTTCTATGTT^ 

AAGGTAAAGAAGAGAAGGGAGAAAATTTTAACACAAAAGATACAACCGAAGAGACGTCTC 

CAP 

961 aaccagtgtaagctacaacttaacttttgttggaacaaattttccaaaccgcccctttgc 
ttggtcacattcgatgttgaattgaaaacaaccttgtttaaaaggtttggcgggga;^ 

1021 CCTAGTGGCAGAGACAATTC AC AAAC acagccctttaaaaaggcttagggatcactaagg 
ggatcaccgtctctgttaagtgtttgtgtcgggaaatttttccgaatccctagtgattcc 

1081 ggatttctagaagagcgacccgtaatccttaagtatttacaagacgaggctaacctccag 
cctaaagatcttctcgctgggcattaggaattcataaatgttctgctccgattggaggtc 

1141 cgagcgtgacagcccagggagggtgcgaggcctgttcaaatgctaagcttccataaataa 
gctcgcactgtcgggtccctcccacgctccggacaagtttacgattcgaaggtatttatt 

1201 agcaaatttcctccggcagtttctggaaagtaggaaaggttaacatttaaggt^ 

tcgtttaaaggaggccgtcaaagacctttcatcctttccaattgtaaattccaacgcaaa 

1261 gttagc atttcagtgtttgccgacctcagctaac agc atccctgcaaggcctcgggagac 
caatcgtaaagtcacaaacggctggagtcgattgtcgtagggacgttccggagccctctg 

1321 ccagaagtttctcgccccttagatccaaacttgagcaacccggagtctggattcctggga 
ggtcttcaaagagcggggaatctaggtttgaactcgttgggcctcagacctaaggaccct 

TopoII 

1381 AGTCCTCAGCTGTCCTGCGGTTGTGCCGGGGCCCCAGGTCTGGAGGGGACCAGTGGCCGT 
TCAGGAGTCGACAGGACGCCAACACGGCCCCGGGGTCCAGACCTCCCCTGGTCACCGGCA 

1441 GTGGCTTCTACTGCTGGGCTGGAAGTCGGGCCTCCTAGCTCTGCAGTCCGAGGCTTGGAG 
CACCGAAGATGACGACCCGACCTTCAGCCCGGAGGATCGAGACGTCAGGCTCCGAACCTC 



1501 CCAGGTGCCTGGACCCCGAGGCTGCCCTCCACCCTGTGCGGGCGGGATGTGACCAGATGT 
GGTCCACGGACCTGGGGCTCCGACGGGAGGTGGGACACGCCCGCCCTACACTGGTCTACA 

1561 TGGCCTCATCTGCCAGACAGAGTGCCGGGGCCCAGGGTCAAGGCCGTTGTGGCTGGTGTG 
ACCGGAGTAGACGGTCTGTCTCACGGCCCCGGGTCCCAGTTCCGGCAACACCGACCACAC 

1621 AGGCGCCCGGTGCGCGGCCAGCAGGAGCGCCTGGCTCCATTTCCCACCCTTTCTCGACGG 
TCCGCGGGCCACGCGCCGGTCGTCCTCGCGGACCGAGGTAAAGGGTGGGAAAGAGCTGCC 
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1681 GACCGCCCCGGTGGGTGATTAACAGATATTGGGGTGGTTTGCTCATGGTGGGGACCCCTT 
CTGGCGGGGCCACCCACTAATTGTCTATAACCCCACCAAACGAGTACCACCCCTGGGGAA 

1741 CGCCGCCTGAGAACCTGCAAAGAGAAATGACGGGCCTGTGTCAAGGAGCCCAAGTCGCGG 
GCGGCGGACTCTTGGACGTTTCTCTTTACTGCCCGGACACAGTTCCTCGGGT^^ 

1801 GGAAGTGTTGCAGGGAGGCACTCCGGGAGGTCCCGCGTGCCCGTCCAGGGAGCAATGCGT 
CCTTCACAACGTCCCTCCGTGAGGCCCTCCAGGGCGCACGGGCAGGTCCCTCGTTACGCA 

1861 CCTCGGGTTCGTCCCCAGCCGCGTCTACGCGCCTCCGTCCTCCCCTTCACGTCCGGCATT 
GGAGCCCAAGCAGGGGTCGGCGCAGATGCGCGGAGGCAGGAGGGGAAGTGCAGGCCGTAA 

1921 CGTGGTGCCCGGAGCCCGACGCCCCGCGTCCGGACCTGGAGGCAGCCCTGGGTCTCCGGA 
GCACCACGGGCCTCGGGCTGCGGGGCGCAGGCCTGGACCTCCGTCGGGACCCAGAGGCCT 

1981 TCAGGCCAGCGGCCAAAGGGTCGCCGCACGCACCTGTTCCCAGGGCCTCCACATCATGGC 
AGTCCGGTCGCCGGTTTCCCAGCGGCGTGCGTGGACAAGGGTCCCGGAGGTGTAGTACCG 

2041 CCCTCCCTCGGGTTACCCCACAGCCTAGGCCGATTCGACCTCTCTCCGCTGGGGCCCTCG 
GGGAGGGAGCCCAATGGGGTGTCGGATCCGGCTAAGCTGGAGAGAGGCGACCCCGGGAGC 

Spl 

2101 CTGGCGTCCCTGCACCCTGGGAGCGCGAGCGGCGCGCGGGCGGGGAAGCGCGGCCCAGAC 
GACCGCAGGGACGTGGGACCCTCGCGCTCGCCGCGCGCCCGCCCCTTCGCGCCGGGTCTG 

2161 CCCCGGGTCCGCCCGGAGCAGCTGCGCTGTCGGGGCCAGGCCGGGCTCCCAGTGGATTCG 
GGGGCCCAGGCGGGCCTCGTCGACGCGACAGCCCCGGTCCGGCCCGAGGGTCACCTAAGC 

2221 CGGGCAACAGACGCCCAGGACCGCGCTTCCCACGTGGCGGAGGGACTGGGGACCCGGGCA 
GCCCGTTGTCTGCGGGTCCTGGCGCGAAGGGTGCACCGCCTCCCTGACCCCTGGGCCCGT 

Spl 



E2F 
******** 

2281 CCGGTCCTGCCCCTTCACCTTCCAGCTCCGCCTCGTCCGCGCGGAACCCCGCCCCGTCCC 
GGCCAGGACGGGGAAGTGGAAGGTCGAGGCGGAGCAGGCGCGCCTTGGGGCGGGGCAGGG 

2341 GAACCCTTCCCGGGTCCCCGGCCCAGCCCCTTCCGGGCCATCCCAGCCCGTCCCGTTCCT 
CTTGGGAAGGGCCCAGGGGCCGGGTCGGGGAAGGCCCGGTAGGGTCGGGCAGGGCAAGGA 

Spl 



E2F NFkB 

********* ^i^^^^if^^^ V^r ****************** * 

2401 TTTCCGCGGCCCCGCCCTCTCCTCGCGGCGCGAGTTTCAGGCAGCGCTGCGTCCTGCTGC 
AAAGGCGCCGGGGCGGGAGAGGAGCGCCGCGCTCAAAGTCCGTCGCGACGCAGGACGACG 

hTRT5 ■ 

*****************************>. 

2461 GCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCGATGCCGCGCGCTCCCCGCTGCCGAG 
CGTGCACCCTTCGGGACCGGGGCCGGTGGGGGCGCTACGGCGCGCGAGGGGCGACGGCTC 

2521 CCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCTGCCGCTGGCCACGTTCGTGC 
GGCACGCGAGGGACGACGCGTCGGTGATGGCGCTCCACGACGGCGACCGGTGCAAGCACG 



F/G. 21 

(CONTINUED) 



31/103 



E2F 
* ★ * * * * * 

2581 GGCGCCTGGGGCCCC AGGGCTGGCGGCTGGTGC AGCGCGGGGACCCGGCGGCTTTCCGC 
CCGCGGACCCCGGGGTCCCGACCGCCGACCACGTCGCGCCCCTGGGCCGCCGAAAGGCGC 



2641 CGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGC ACGGCCGCCCCCCGCCGCCC 
GCGACCACCGGGTCACGGACCACACGCACGGGACCCTGCGTGCCGGCGGGGGGCGGCGGG 

NFkB 



2701 CCTCCTTCCGCCAGGTGGGCCTCCCCGGGGTCGGCGTCCGGCTGGGGTTGAGGGCGGCCG 
GGAGGAAGGCGGTCCACCCGGAGGGGCCCCAGCCGCAGGCCGACCCCAACTCCCGCCGGC 

Topo_I I„c 1 e avag 



NFkB 
++++++++++ 
NFkB 

Intronl 

2761 GGGGGAACCAGCGACATGCGGAGAGC AGCGCAGGCGACTCAGGGCGCTTCCCCCGC AGGT 
CCCCCTTGGTCGCTGTACGCCTCTCGTCGCGTCCGCTGAGTCCCGCGAAGGGGGCGTCCA 

e_site 



2821 GTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAA 
CAGGACGGACTTCCTCGACCACCGGGCTCACGACGTCTCCGACACGCTCGCGCCGCGCTT 

2881 GAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGC 
CTTGCACGACCGGAAGCCGAAGCGCGACGACCTGCCCCGGGCGCCCCCGGGGGGGCTCCG 



2941 CTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGC ACTGCGGGGGAG 
GAAGTGGTGGTCGCACGCGTCGATGGACGGGTTGTGCCACTGGCTGCGTGACGCCCCCTC 

3001 CGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGTGCTGGTTC ACCTGCTGGC 
GCCCCGCACCCCCGACGACGACGCGGCGCACCCGCTGCTGCACGACCAAGTGGACGACCG 

3061 ACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACC AGGTGTGCGGGCCGCC 
TGCGACGCGCGAGAAACACGACCACCGAGGGTCGACGCGGATGGTCCACACGCCCGGCGG 

3121 GCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCCACACGCTAGTGGACCCCG 
CGACATGGTCGAGCCGCGACGGTGAGTCCGGGCCGGGGGCGGTGTGCGATCACCTGGGGC 

3181 AAGGCGTCTGGGATGCGAACGGGCCTGGAACC ATAGCGTC AGGGAGGCCGGGGTCCCCCT 
TTCCGCAGACCCTACGCTTGCCCGGACCTTGGTATCGCAGTCCCTCCGGCCCCAGGGGGA 

3241 GGGCCTGCC AGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGCCAGCCGAAGTCTGCCGTT 
CCCGGACGGTCGGGGCCCACGCTCCTCCGCGCCCCCGTCACGGTCGGCTTCAGACGGCAA 

3301 GCCC AAGAGGCCC AGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGC AGGG 
CGGGTTCTCCGGGTCCGCACCGCGACGGGGACTCGGCCTCGCCTGCGGGCAACCCGTCCC 
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3361 GTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCGTGGTTTCTGTGTGGTGTC 
CAGGACCCGGGTGGGCCCGTCCTGCGCACCTGGCTCACTGGCACCAAAGACACACCACAG 

3421 ACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGC ACGCGCCA 
TGGACGGTCTGGGCGGCTTCTTCGGTGGAGAAACCTCCCACGCGAGAGACCGTGCGCGGT 

3481 CTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCCATCCACATCGCGGCCACC 
GAGGGTGGGTAGGCACCCGGCGGTCGTGGTGCGCCCGGGGGGTAGGTGTAGCGCCGGTGG 

3541 ACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTC 
TGCAGGGACCCTGTGCGGAACAGGGGGCCACATGCGGCTCTGGTTCGTGAAGGAGATGAG 

3601 CTCAGGCGACAAGGAGC AGCTGCGGCCCTCCTTCCTACTCAGCTCTCTGAGGCCC AGCCT 
GAGTCCGCTGTTCCTCGTCGACGCCGGGAGGAAGGATGAGTCGAGAGACTCCGGGTCGGA 

3661 GACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGG 
CTGACCGCGAGCCTCCGAGCACCTCTGGTAGAAAGACCCAAGGTCCGGGACCTACGGTCC 

3721 GACTCCCCGCAGGTTGCCCCGCCTGCCCCAGGGCTACTGGCAAATGCGGCCCCTGTTTCT 
CTGAGGGGCGTCCAACGGGGCGGACGGGGTCGCGATGACCGTTTACGCCGGGGACAAAGA 

3781 GGAGCTGCTTGGGAACC ACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGC ACTGCCC 
CCTCGACGAACCCTTGGTGCGCGTCACGGGGATGCCCCACGAGGAGTTCTGCGTGACGGG 

3841 GCTGCGAGCTGCGGTCACCCCAGC AGCCGGTGTCTGTGCCCGGGAGAAGCCCC AGGGCTC 
CGACGCTCGACGCCAGTGGGGTCGTCGGCCACAGACACGGGCCCTCTTCGGGGTCCCGAG 

3901 TGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCA 
ACACCGCCGGGGGCTCCTCCTCCTGTGTCTGGGGGCAGCGGACCACGTCGACGAGGCGGT 

3961 GCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTGCCTGCGCCGGCTGGTGCC 
CGTGTCGTCGGGGACCGTCCACATGCCGAAGCACGCCCGGACGGACGCGGCCGACCACGG 

4021 CCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAAC ACCAAGAAGTT 
GGGTCCGGAGACCCCGAGGTCCGTGTTGCTTGCGGCGAAGGAGTCCTTGTGGTTCTTCAA 



4081 CATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCTGACGTGGAAGATGAGCGT 
GTAGAGGGACCCCTTCGTACGGTTCGAGAGCGACGTCCTCGACTGCACCTTCTACTCGCA 



4141 GCGGGACTGCGCTTGGCTGCGC AGGAGCCCAGGTGAGGAGGTGGTGGCCGTCGAGGGCCC 
CGCCCTGACGCGAACCGACGCGTCCTCGGGTCCACTCCTCCACCACCGGCAGCTCCCGGG 



4201 AGGCCCC AGAGCTGAATGCAGTAGGGGCTC AGAAA?^GGGGGCAGGC AGAGCCCTGGTCCT 
TCCGGGGTCTCGACTTACGTCATCCCCGAGTCTTTTCCCCCGTCCGTCTCGGGACCAGGA 



4261 CCTGTCTCCATCGTCACGTGGGCACACGTGGCTTTTCGCTCAGGACGTCGAGTGGACACG 
GGACAGAGGTAGCAGTGCACCCGTGTGCACCGAAAAGCGAGTCCTGCAGCTCACCTGTGC 



Intron2 



4321 



* * ★ * 

GTGATCGAGGTCGAC 
CACTAGCTCCAGCTG 
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gccaagttcctgcactggctgatgagtgtgtacgtcgtcgagctgctcaggtctttcttt 
tatgtcacggagaccacgtttcaaaagaacaggctctttttctaccggaagagtgtctgg 
agcaagttgcaaagcattggaatcagacagcacttgaagagggtgcagctgcgggacgtg 
tcggaagcagaggtcaggcagcatcgggaagccaggcccgccctgctgacgtccagactc 
cgcttcatccccaagcctgacgggctgcggccgattgtgaacatggactacgtcgtggga 
gccagaacgttccgcagagaaaagagggccgagcgtctcacctcgagggtgaaggcactg 
ttcagcgtgctcaactacgagcgggcgcg 
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TCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAG 
ACCAGCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCC 
AGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATC 
AGGGGCAAGTC 

FIG. 24 



PAMELA 




elution with 
displacement 
oligonucleotide 



PANEL B 




FIG. 26 



35/1 03 



pBB5212 pGRN133 



Approximate Cell No. 





5,000 5,000 5,000 5,000 
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Internal Control 
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Telomerase: 
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TELOMERASE 



TEMPLATE 

CCAAAACCCCAAAAC 
.TTTTGGGG-3 • 



TELOMERASE 



TEMPLATE 

CCAAAACCCCAAAAC ■ 
^gQGTTTT-3 ' 




PANEL A 



PANEL B 
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1 CCCCAAAACC CCAAAACCCC A?^AACCCCTA TAAAAAAAGA AAAAATTGAG 

51 GTAGTTTAGA AATAAAATAT TATTCCCGCA CAAATGGAGA TGGATATTGA 

101 TTTGGATGAT ATAGAAAATT TACTTCCTAA TACATTCAAC AAGTATAGCA 

151 GCTCTTGTAG TGACAAGAAA GGATGCAAAA CATTGAAATC TGGCTCGAAA 

201 TCGCCTTCAT TGACTATTCC AAAGTTGCAA AA?VCAATTAG AGTTCTACTT 

251 CTCGGATGCA AATCTTTATA ACGATTCTTT CTTGAGAAAA TTAGTTTTAA 

301 AAAGCGGAGA GCAAAGAGTA GAAATTGAAA CATTACTAAT GTTTAAATAA 

351 AATCAGGTAA TGAGGATTAT TCTATTTTTT AGATCACTTC TTAAGGAGCA 

401 TTATGGAGAA AATTACTTAA TACTAAAAGG TAAACAGTTT GGATTATTTC 

451 CCTAGCCAAC AATGATGAGT ATATTAAATT CATATGAGAA TGAGTCAAAG 

501 GATCTCGATA CATCAGACTT ACCAAAGACA AACTCGCTAT AAAACGCAAG 

551 AAAAAGTTTG ATAATCGAAC AGCAGAAGAA CTTATTGCAT TTACTATTCG 

601 TATGGGTTTT ATTACAATTG TTTTAGGTAT CGACGGTGAA CTCCCGAGTC 

651 TTGAGACAAT TGAAAAAGCT GTTTACAACT GAAGGAATCG CAGTTCTGAA 

701 AGTTCTGATG TGTATGCCAT TATTTTGTGA ATTAATCTCA AATATCTTAT 

751 CTCAATTTAA TGGATAGCTA TAGAAACAAA CCAAATAAAC CATGCAAGTT 

801 TAATGGAATA TACGTTAAAT CCTTTGGGAC AAATGCACAC TGAATTTATA 

851 TTGGATTCTT AAAGCATAGA TACACAGAAT GCTTTAGAGA CTGATTTAGC 

901 TTACAACAGA TTACCTGTTT TGATTACTCT TGCTCATCTC TTATATCTTT 

951 AAAAGAAGCA GGCGAAATGA AAAGAAGACT AAAGAAAGAG ATTTCAAAAT 

1001 TTGTTGATTC TTCTGTAACC GGAATTAACA ACAAGAATAT TAGCAACGAA 

1051 AAAGAAGAAG AGCTATCACA ATCCTGATTC TTAAAGATTT CAAAAATTCC 

1101 AGGTAAGAGA GATACATTCA TTAAAATTCA TATATTATAG TTTTTCATTT 

1151 CACAGCTGTT ATTTTCTTTT ATCTTAACAA TATTTTTTGA TTAGCTGGAA 

1201 GTAAAAAGTA TCAAATAAGA GAAGCGCTAG ACTGAGGTAA CTTAGCTTAT 

1251 TCACATTCAT AGATCGACCT TCATATATCC AATACGATGA TAAGGAAACA 

13 01 GCAGTCATCC GTTTTAAAAA TAGTGCTATG AGGACTAAAT TTTTAGAGTC 

1351 AAGAAATGGA GCCGAAATCT TAATCAAAAA GAATTGCGTC GATATTGCAA 

1401 AAGAATCGAA CTCTAAATCT TTCGTTAATA AGTATTACCA ATCTTGATTG 

1451 ATTGAAGAGA TTGACGAGGC AACTGCACAG AAGATCATTA AAGAAATAAA 

1501 GTAACTTTTA TTAATTAGAG AATAAACTAA ATTACTAATA TAGAGATCAG 

1551 CGATCTTCAA TTGACGAAAT AAAAGCTGAA CTAAAGTTAG ACAATAAAAA 

1601 ATACAAACCT TGGTCAAAAT ATTGAGGAAG GAAAAGAAGA CCAGTTAGCA 

1651 AAAGAAAAAA TAAGGCAATA AATAAAATGA GTACAGAAGT GAAGAAATAA 

1701 AAGATTTATT TTTTTCAATA ATTTATTGAA AAGAGGGGTT TTGGGGTTTT 

1751 GGGGTTTTGG GG 
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CCCCAAAACCCCAAAACCCCAAAACCCCTATAAAAAAAGAAAAAATTGAGGTAGTTTAGA 
1 + + + + + + 60 

GGGGTTTTGGGGTTTTGGGGTTTTGGGGATATTTTTTTCTTTTTTAACTC 

a PQN PKTPKPL*KKKKLR*FR 

b PKTPKPQNPyKKRKN*GSLE- 

c PKPQNPKTPIKKEKIEVV*K- 

AATAAAATATTATTCCCGCACAAATGGAGATGGATATTGATTTGGATGATATAGAAAATT 
61 + + + + + + 120 

TTATTTTATAATAAGGGCGTGTTTACCTCTACCTATAACTAAACCTACTATATCTTTTAA 

a NK ILFPHKW.R WI LIWMI*KI 

b IKYY SRTNGDGY*FG*y RKF- 

c *niipaqmemdidl'ddienl- 

TACTTCCTAATACATTCAACAAGTATAGCAGCTCTTGTAGTGACAAGAAAGGATGCAAAA " 

121 + + + + + — + 180 

ATGAAGGATTATGTAAGTTGTTCATATCGTCGAGAACATCACTGTTCTTTCCTACGTTTT 

a YFLIHS TSIAALVVTRKDAK 

b TS*YIQQV*QLL*CQERMQN- 

C LPNTFNKYSSSCSDKKGCKT- 

CATTGAAATCTGGCTCGAAATCGCCTTCATTGACTATTCCAAAGTTGCAAAAACAATTAG 

181 + + + + -f + 240 

GTAACTTTAGACCGAGCTTTAGCGGAAGTAACTGATAAGGTTTCAACGTTTTTGTTAATC 

a H*NLARNRLH* LF 'QSCKNN* 

b lEIWLEIAFIDYS KVAKTIR- 

c LKSGSKSPSLTIPKLQKQLE- 

AGTTCTACTTCTCGGATGCAAATCTTTATAACGATTCTTTCTTGAGAAAATTAGTTTTAA 

241 + + + + + + 300 

TCAAGATGAAGAGCCTACGTTTAGAAATATTGCTAAGAAAGAACTCTTTTAATCAAAATT 

a SSTSRMQIFITILS*E N*F* 

b VLLLGCKSL*RFFLEKISFK- 

C FYFSDANLYNDSFLRKLVLK- 

AAAGCGGAGAGCAAAGAGTAGAAATTGAAACATTACTAATGTTTAAATAAAATCAGGTAA 
301 + + + + + + 360 

TTTCGCCTCTCGTTTCTCATCTTTAACTTTGTAATGATTACAAATTTATTTTAGTCCATT 

a KAESKE*KLKHY*CLNKIR* 

b KRRAKSRN*NITNV*IKSGN- 

C SGE QRVEIETLLMFK*NQVM- 

TGAGGATTATTCTATTTTTTAGATCACTTCTTAAGGAGCATTATGGAGAAAATTACTTAA 
361 + + + + + + 420 

. ACTCCTAATAAGATAAAAAATCTAGTGAAGAATTCCTCGTAATACCTCTTTTAATGAATT 

a *GLFYFLDHFLR SIMEKIT* 

b EDYSIF*ITS *GALWRKLLN- 

c RI ILFFRSLLKEHYGENYLI-' 
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TACTAAAAGGTAAACAGTTTGGATTATTTCCCTAGCCAACAATGATGAGTATATTAAATT 

421 + + + + + + 480 

ATGATTTTCCATTTGTCAAACCTAATAAAGGGATCGGTTGTTACTACTCATATAATTTAA 

a Y*K VNSLDYFPS .QQ* *VY*I 

b TKR*TVWIISLANNDEYIK. F- 

C LKGKQFGLFP*PTMMSILNS- 

CATATGAGAATGAGTCAAAGGATCTCGATACATCAGACTTACCAAAGACAAACTCGCTAT 

481 + + + + + + 540 

GTATACTCTTACTCAGTTTCCTAGAGCTATGTAGTCTGAATGGTTTCTGTTTGAGCGATA 

a HMRMSQRISIHQTYQR QTRY 

b I*E*VKGSRYIRLTKDKLAI-- 

C YENESKDLDTSDLPKTNSL*- 

AAAACGCAAGAAAAAGTTTGATAATCGAACAGCAGAAGAACTTATTGCATTTACTATTCG . 

541 + + + : + --^+ + 600 

TTTTGCGTTCTTTTTCAAACTATTAGCTTGTCGTCTTCTTGAATAACGTAAATGATAAGC 

a KTQ.EKV**SNSRRTYCIYYS 

b KRKKKFDNRTAEELIAFTIR. - 

C NARKSLIIEQQKNLLHLLFV- 

TATGGGTTTTATTACAATTGTTTTAGGTATCGACGGTGAACTCCCGAGTCTTGAGACAAT 

601 . — + + + + + + 660 

ATACCCAAAATAATGTTAACAAAATCCATAGCTGCCACTTGAGGGCTCAGAACTCTGTTA 

a YGFYYNCFRYRR*TPES*DN 

b MGFI .TIVLGIDGELPSLETI- 

C WVLLQLF*VSTVNSRVLRQ L- 

TGAAAAAGCTGTTTACAACTGAAGGAATCGCAGTTCTGAAAGTTCTGATGTGTATGCCAT 

661 + + + + + + 720 

ACTTTTTCGACAAATGTTGACTTCCTTAGCGTCAAGACTTTCAAGACTACACATACGGTA 

a *KSCLQLKESQF*KFCCVCH 

b EKAVYN* RNRSSESSDVYAI- 

C KKLFTTEGIAVLKVLMCMPL^- 

TATTTTGTGAATTAATCTCAAATATCTTATCTCAATTTAATGGATAGCTATAGAAACAAA 

721 + + + + + + 780 

ATAAAACACTTAATTAGAGTTTATAGAATAGAGTTAAATTACCTATCGATATCTTTGTTT 

a YFVN*SQISYLNLMD S Yr'nK 

b IL*INLKYLISI*WIAIETN- 

C - FCELI SNILSQFNG*L*KQT- 

CCAAATAAACCATGCAAGTTTAATGGAATATACGTTAAATCCTTTGGGACAAATGCACAC 

781 + + : + + + + 840 

GGTTTATTTGGTACGTTCAAATTACCTTATATGCAATTTAGGAAACCCTGTTTACGTGTG 

a PNK P CKFNGIYVKS FGT NAH 

b QINHASLMEYT LNPLGQMHT- 

c K*TMQV*WNIR* ILWDKCTL- 

TGAATTTATATTGGATTCTTAAAGCATAGATACACAGAATGCTTTAGAGACTGATTTAGC 

841 + + + + + + 900 

ACTTAAATATAACCTAAGAATTTCGTATCTATGTGTCTTACGAAATCTCTGACTAAATCG 

a *IY IGFLKHRYTECFRD*FS 

b EFILDS*SIDTQNALETDLA- 

C NLYWILKA*IHRML*RLI*. L- 
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TTACAACAGATTACCTGTTTTGATTACTCTTGCTCATCTCTTATATCTTTAAAAGAAGCA 

901 + + + + + + 960 

AATGTTGTCTAATGGACAAAACTAATGAGAACGAGTAGAGAATATAGAAATTTTCTTCGT 

a LQQI 'TCFDYSCSSLISLKEA 

b YNRLPVLITLAHLLYL*KKQ- 

C TTDYLF*. LLLLISYIFKRSR- 

GGCGAAATGAAAAGAAGACTAAAGAAAGAGATTTCAAAATTTGTTGATTCTTCTGTAACC 

961 + + + + + + 1020 

CCGCTTTACTTTTCTTCTGATTTCTTTCTCTAAAGTTTTAAACAACTAAGAAGACATTGG 

a GEM KRRLKKEIS KF VDSSVT 

b ak*ke'd*rkrfqnllill*p- 
c rnekktkerd fkic*ffcnr- 

ggaattaacaacaagaatattagcaacgaaaaagaagaagagctatcacaatcctgattc - 

1021 + + + + + + 1080 

ccttaattgttgttcttataatcgttgctttttcttcttctcgatagtgttaggactaag 

a-ginnknisnekeeelsqs*f 
b elttr ila tkkkksyhnpds- 
c n*qqey*qrkrrraitilil- 

ttaaagatttcaaaaattccaggtaagagagatacattcattaaaattcatatattatag 

1081 + + + + + + 1140 

AATTTCTAAAGTTTTTAAGGTCCATTCTCTCTATGTAAGTAATTTTAAGTATATAATATC 

a LKISKIPGKRDTFIKIHIL* 

b *RFQKFQVREIHSLKFIYYS- 

c K D F K N S .R * E R.Y I H * N S Y I I V- 

TTTTTCATTTCACAGCTGTTATTTTCTTTTATCTTAACAATATTTTTTGATTAGCTGGAA 

1141 + + + + + + 1200 

AAAAAGTAAAGTGTCGACAATAAAAGAAAATAGAATTGTTATAAAAAACTAATCGACCTT 

a FFISQLLFSF ILTIFFD *LE 

b FS FHSCYFLLS*QYFLISWK- 

C FHFTAVIF FYLNNIF*LAGS- 

GTAAAAAGTATCAAATAAGAGAAGCGCTAGACTGAGGTAACTTAGCTTATTCACATTCAT 

1201 + + + + + + 1260- 

CATTTTTCATAGTTTATTCTCTTCGCGATCTGACTCCATTGAATCGAATAAGTGTAAGTA 

a VKSIK*EKR* TEVT*LIHIH 

b *KVSNKRSARLR*LSLFTFI- 

c kk:^yqireald*gnlayshs*- 

AGATCGACCTTCATATATCCAATACGATGATAAGGAAACAGCAGTCATCCGTTTTAAAAA 

1261 + + + + + + 1320 

TCTAGCTGGAAGTATATAGGTTATGCTACTATTCCTTTGTCGTCAGTAGGCAAAATTTTT 

a rstfiypir**gnsshpf*k 

b DRPSYIQYDDKETAVIRFKN- 
C IDLHISNTMIRKQQ SSVLKI- 

TAGTGCTATGAGGACTAAATTTTTAGAGTCAAGAAATGGAGCCGAAATCTTAATCAAAAA 
1321 + + + + + + 1380 

ATCACGATACTCCTGATTTAAAAATCTCAGTTCTTTACCTCGGCTTTAGAATTAGTTTTT • 

a *CYED*IFRV KKWSRNLNQK 

b SAMRTKFLESRNGAEILIKK- 

C VL*GLNF*SQEMEPKS*SKR- 
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GAATTGCGTCGATATTGCAAAAGAATCGAACTCTAAATCTTTCGTTAATAAGTATTACCA 

1381 + + + + + + 1440 

CTTAACGCAGCTATAACGTTTTCTTAGCTTGAGATTTAGAAAGCAATTATTCATAATGGT 

a..ELRRYCKRl'EL*IFR **VLP 

b NCVDIA KESNSKSFVNKYYQ- 

C lASILQKNRTLNLSLISITN- 

ATCTTGATTGATTGAAGAGATTGACGAGGCAACTGCACAGAAGATCATTAAAGAAATAAA 

1441 + + + + + + 1500 

TAGAACTAACTAACTTCTCTAACTGCTCCGTTGACGTGTCTTCTAGTAATTTCTTTATTT 

a ILIDCRDCRG NCTE DH*R N K - 
b S*LIEEIDEATAQKIIKEIK- 
c L D *.LKRLTR QLHRRSLKK * S- 

GTAACTTTTATTAATTAGAGAATAAACTAAATTACTAATATAGAGATCAGCGATCTTCAA 

1501 + + - + + + + 1560 

CATTGAAAATAATTAATCTCTTATTTGATTTAATGATTATATCTCTAGTCGCTAGAAGTT 

a V.TFIN*RIN*ITNIEISDLQ 

b * LLLIRE*TKLL I*RSAIFN- 

C NF Y * L.ENKLNY * YRDQRS S I- 

TTGACGAAATAAAAGCTGAACTAAAGTTAGACAATAAAAAATACAAACCTTGGTCAAAAT 

1561 + + + -+ + + 1620 

AACTGCTTTATTTTCGACTTGATTTCAATCTGTTATTTTTTATGTTTGGAACCAGTTTTA 

a LTK*KLN*S*TIKNTNLGQ N 

b *RNKS*TK VRQ*KIQTLVKI .- 

C DEI KAELKLDNKK YKPWSK Y- 

ATTGAGGAAGGAAAAGAAGACCAGTTAGCAAAAGAAAAAATAAGGCAATAAATAAAATGA 

1621 + + + + + + 1680 

TAACTCCTTCCTTTTCTTCTGGTCAATCGTTTTCTTTTTTATTCCGTTATTTATTTTACT 

a IEEGKEDQLAKEKIRQ*IK* 

b LRKEK KTS* QKKK*GNK*NE- 

C *GRKRRPVSKRK NKAINKMS- 

GTACAGAAGTGAAGAAATAAAAGATTTATTTTTTTCAATAATTTATTGAAAAGAGGGGTT 

1681 + + + + + + 1740 

CATGTCTTCACTTCTTTATTTTCTAAATAAAAAAAGTTATTAAATAACTTTTCTCCCCAA 

a VQK*R,NKRFIFFNNLLKRGV 

b YRSEEIKDLFFS IIY* KEGF- 

c TEVKK*KIYFFQ*FIEKRGF- 

TTGGGGTTTTGGGGTTTTGGGG 

1741 + + — 1762 

AACCCCAAAACCCCAAAACCCC 

aLGFWGF G- 
b WGFGVL G 
c GVLGFW 
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MEIENNQAQQPKAEKLWWELELEMQENQNDIQVRVKIDDPKQYL 
VNVTAACLLQEGSYYQDKDERRYIITKALLEVAESDPEFICQLA 
VYIRNELYIRTTTNYIVAFCWHKNTQPFIEKYFNKAVLLPNDL 
LEVCEFAQVLYIFDATEFKNLYLDRILSQDIRKELTFRKCLQRC 
VRSKFSEFNEYQLGKYCTESQRKKTMFRYLSVTNKQKWDQTKKK 
RKENLLTKLQAIKESEDKSKRETGDIMNVEDAIKALKPAVMKKI 
AKJIQNAMKKHMKAPKIPNSTLESKYLTFKDLIKFCHISEPKERV 
YKILGKKYPKTEEEYKAAFGDSASAPFNPELAGKRMKIEISKTW 
ENELSAKGNTAEVWDNLISSNQLPYMAMLRNLSNILKAGVSDTT 
HSIVINKICEPKAVENSKMFPLQFFSAIEAVNEAVTKGFKAKKR 
ENMNLKGQIEAVKEWEKTDEEKKDMELEQTEEGEFVKVNEGIG 
KQYINSIELAIKIAVNKNLDEIKGHTAIFSDVSGSMSTSMSGGA 
KKYGSVRTCLECALVLGLMVKQRCEKSSFYIFSSPSSQCNKCYL 
EVDLPGDELRPSMQKLLQEKGKLGGGTDFPYECIDEWTKNKTHV 
DNIVILSDMMIAEGYSDINVRGSSIVNSIKKYKDEVNPNIKIFA 
VDLEGYGKCLNLGDEFNENNYIKIFGMSDSILKFISAKQGGANM 
VEVIKNFALQKIGQK 

FIG. 43 

. MS RRNQKKPQAP I GNETNLDFVLQNLEVYKS Q I EHYKTQQQQ I K 
EEDLKLLKFKNQDQDGNSGNDDDDEENNSNKQQELLRRVWQIKQ 
QVQLIKKVGSKVEKDLNLNEDENKKNGLSEQQVKEEQLRTITEE 
QVKYQNLVFNMDYQLDLNESGGHRRHRRETDYDTEKWFEISHDQ 
KNYVSIYANQKTSYCWWLKDYFNKNNYDHLNVSINRLETEAEFY 
AFDDFSQTIKLTNNSYQTVNIDVNFDNNLCILALLRFLLSLERF 
NILNIRSSYTRNQYNFEKIGELLETIFAWFSHRHLQGIHLQVP 
CEAFQYLVNSSSQISVKDSQLQVYSFSTDLKLVDTNKVQDYFKF 
LQEFPRLTHVSQQAIPVSATNAVENLNVLLKKVKHANLNLVSIP 
TQFNFDFYFVlSIIiQHLKLEFGLEPNILTKQKLENLLLSIKQSKNL 
KFLRLNFYTYVAQETSRKQILKQATTIKNLKNNXNQEETPETKD 
ETPSESTSGMKFFDHLSELTELEDFSVNLQATQEIYDSLHKLLI 
RSTNLKKFKLSYKYEMEKSKMDTFIDLKNIYETLNNLKRCSVNI 
SNPHGNISYELTNKDSTFYKFKLTLNQELQHAKYTFKQNEFQFN 
NVKSAKIESSSLESLEDIDSLCKSIASCKNLQNVNIIASLLYPN 
NIQKNPFNKPNLLFFKQFEQLKNLENVSINCILDQHILNSISEF 
LEKNKKIKAFILKRYYLLQYYLDYTKLFKTLQQLPELNQVYINQ 
QLEELTVSEVHKQVWENHKQKAFYEPLCEFIKESSQTLQLIDFD 
QNTVSDDSIKKILESISESKYHHYLRLNPSQSSSLIKSENEEIQ 
ELLKACDEKGVLVKAYYKFPLCLPTGTYYDYNSDRW 

FIG. 45 



MKILFEFIQDKLDIDLQTNSTYKENIiKCGHFNGLDEILTTCFAL 
PNSRKIALPCLPGDLSHKAVIDHCIIYLLTGELYNNVLTFGYKI 
ARNEDVNNSLFCHSANVNVTLLKGAAVi^FHSLVGTYAFVDLLI 
NYTVIQFNGQFFTQIVGNRCNEPHLPPKWVQRSSSSSATAAQIK 
QLTEPVTNKQFLHKLNINSSSFFPYSKILPSSSSIKKLTDLREA 
IFPTNLVKIPQRLKVRINLTLQKLLKRHKRLNYVSILNSICPPL 
EGTVLDLSHLSRQSPKERVLKFIIVILQKLLPQEMFGSKKNKGK 
IIKNLNLLLSLPLNGYLPFDSLLKKLRLKDFRWLFISDIWFTKH 
NFENLNQLAICFI SWLFRQLI PKI I QTFFYCTEI S STVTIVYFR' 
HDTWNKLITPFIVEYFKTYLVENNVCRJSTHNSYTLSNFNHSKMRI 
IPKKSNNEFRIIAIPCRGADEEEFTIYKENHKNAIQPTQKILEY 
LRNKRPTSFTKIYSPTQIADRIKEFKQRLLKKFNNVLPELYFMK 
FDVKSC YDS I PRMECMRILKDALKNENGFFVRSQYFFNTNTGVL 
KLFNVVNASRVPKPYELYIDNVRTVHLSNQDVINWEMEIFKTA 
LWVEDKCYIREDGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPS 
QDTLILKLADDFLI I STDQQQVINI KKLAMGGFQKYNAKANRDK 
ILAVSSQSDDDTVIQFCAMHIFVKELEVWKHSSTMNNFHIRSKS 
SKGIFRSLIALFNTRISYKTIDTNLNSTNTVLMQIDHVVKNISE 
CYKSAFKDLSINVTQNMQFHSFLQRIIEMTVSGCPITKCDPLIE 
YEVRFTILNGFLESLSSNTSKFKDNIILLRKEIQHLQAYIYIYI 
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LCVSY ILS S FYYANLEENALQFLRKE SMDPEKPETNLLMRLT 
LCVSSILSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLT 



FIG, 47 



ATTTATACTCATGAAAATCTTATTCGAGTTCATTCAAGACAAGCTTGACATTGATCTACA 
GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 
TCTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 
TGACTTAAGCCACAAAGCAGTCATTGATCACTGCATCATTTACCTGTTGACGGGCGAATT 
ATACAACAACGTACTAACATTTGGCTATAAAATAGCTAGAAATGAAGATGTCAACAATAG 
TCTTTTTTGCCATTCTGCAAATGTTAACGTTACGTTACTGAAAGGCGCTGCTTGGAAAAT 
GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 
TCAATTTAATGGGCAGTTTTTCACTCAAATCGTGGGTAACAGATGTAACGAACCTCATCT 
GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 
ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTCAATATAAATTCCTCTTC 
TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 
GAGAGAAGCTATTTTTCCCACAAATTTGGTTAAAATTCCTCAGAGACTAAAGGTACGAAT 
TAATTTGACGCTGCAAAAGCTATTAAAGAGACATAAGCGTTTGAATTACGTTTCTATTTT 
GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 
ATCACCAAAGGAACGAGTCTTGAAATTTATCATTGTTATTTTACAGAAGTTATTACCCCA 
AGAAATGTTTGGCTCAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAATCTTTTATT 
AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 
GGATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCACAATTTTGAAAACTT 
GAATCAATTGGCGATTTGTTTCATTTCCTGGCTATTTAGACAACTAATTCCCAAAATTAT 
ACAGACTTTTTTTTACTGCACCGAAATATCTTCTACAGTGACAATTGTTTACTTTAGACA 
TGATACTTGGAATAAACTTATCACCCCTTTTATCGTAGAATATTTTAAGACGTACTTAGT 
CGAAAACAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 
AATGAGGATTATACCAAAAAAAAGTAATAATGAGTTCAGGATTATTGCCATCCCATGCAG 
AGGGGCAGACGAAGAAGAATTCACAATTTATAAGGAGAATCACAAAAATGCTATCCAGCC 
CACTCAAAAAATTTTAGAATACCTAAGAAACAAAAGGCCGACTAGTTTTACTAAAATATA 
TTCTCCAACGCAAATAGCTGACCGTATCAAAGAATTTAAGCAGAGACTTTTAAAGAAATT 
TAATAATGTCTTAGCAGAGCTTTATTTCATGAAATTTGATGTCAAATCTTGCTATGATTC 
CATACCAAGGATGGAATGTATGAGGATACTCAAGGATGCGCTAAAAAATGAAAATGGGTT 
TTTCGTTAGATCTCAATATTTCTTCAATACCAATACAGGTGTATTGAAGTTATTTAATGT 
TGTTAACGCTAGCAGAGTACCAAAACCTTATGAGCTATACATAGATAATGTGAGGACGGT 
TCATTTATCAAATCAGGATGTTATAAACGTTGTAGAGATGGAAATATTTAAAACAGCTTT 
GTGGGTTGAAGATAAGTGCTACATTAGAGAAGATGGTCTTTTTCAGGGCTCTAGTTTATC 
TGCTCCGATCGTTGATTTGGTGTATGACGATCTTCTGGAGTTTTATAGCGAGTTTAAAGC 
CAGTCCTAGCCAGGACACATTAATTTTAAAACTGGCTGACGATTTCCTTATAATATCAAC 
AGACCAACAGCAAGTGATCAATATCAAAAAGCTTGCCATGGGCGGATTTCAAAAATATAA 
TGCGAAAGCCAATAGAGACAAAATTTTAGCCGTAAGCTCCCAATCAGATGATGATACGGT 
TATTCAATTTTGTGCAATGCACATATTTGTTAAAGAATTGGAAGTTTGGAAACATTCAAG 
CACAATGAATAATTTCCATATCCGTTCGAAATCTAGTAAAGGGATATTTCGAAGTTTAAT 
AGCGCTGTTTAACACTAGAATCTCTTATAAAACAATTGACACAAATTTAAATTCAACAAA 
CACCGTTCTCATGCAAATTGATCATGTTGTAAAGAACATTTCGGAATGTTATAAATCTGC 
TTTTAAGGATCTATCAATTAATGTTACGCAAAATATGCAATTTCATTCGTTCTTACAACG 
CATCATTGAAATGACAGTCAGCGGTTGTCCAATTACGAAATGTGATCCTTTAATCGAGTA 
TGAGGTACGATTCACCATATTGAATGGATTTTTGGAAAGCCTATCTTCAAACACATCAAA 
ATTTAAAGATAATATCATTCTTTTGAGAAAGGAAATTCAACACTTGCAAGC 



FIG. 48 
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AKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKR 
VQLRDVSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYWGARTFRREK 
RAERLTSRVKALF S VLNYERA 



FIG. 49 



GCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 
■ TTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACC 
GGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAAG 
AGGGTGCAGCTGCGGGACGTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGC 
CAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGACGGGC 
TGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGA 
GAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCT 
CAACTACGAGCGGGCGCG 



FIG, 50 



MTEHHTPKSRILRFLENQYVYLCTLNDYVQLVLRGSPASSYSNICERLRSDVQTSFSIFLHSTWGF 

DSKPDEGVQFSSPKCSQSELIAJSrVVKQMFDESFERRRNLLMKGFSMNHEDFRAMH^ 

PNYLISILESKNWQLLLEIIGSDAlfflYLLSKGSIFEALPNDNyLQISGIPLFK]^ 

TIETSITQNKSARKEVSWNSISISRFSIFYRSSYKKFKQDLYFNLHSICDRNTVHMWLQWIFPRQFG 

LINAFQVKQLHKVIPLVSQSTWPKJILLKVYPLIEQTAKRLHRISLSKVYNHYCPYIDTHDDEKI 

YSLKPNQVFAFLRSILVRVFPKLIWGNQRIFEIILKDLETFLKLSRYESFSLHYLMSNIKISEIEWL 

VLGKRSNAKMCLSDFEKRKQIFAEFIYWLYNSFIIPILQSFFYITESSDLRNRTVYFRKDIWKLLCR 

PFITSMKMEAFEKINENNVRMDTQKTTLPPAVIRLLPKKNTFRLITN^ 

QTLRPVASILKHLINEESSGIPFNLEVYMKLLTFKKDLLKHRMFGRKKYFVRIDIKSCYDRIKQDLM 

FRIVKKKLKDPEFVIRKYATIHATSDRATKNFVSEAFSYFDWPFEKWQLLSMKTSDT^ 

WTKSSSEIFKMLKEHLSGHIVKIGNSQYLQKVGIPQGSILSSFLCHFYMEDLIDEYLSFTKKKGSVL 

LRVVDDFLFITVNKKDAKKFLNLSLRGFEKHNFSTSLEKTVINFENSNGIINNTFFNESK^ 

FSVl^SLDTLLACPKIDEALFNSTSVELTKHMGKSFFYKILRSSLASFAQVFIDITHNSKFNSCCN 

lYRLGYSMCMRAQAYLKRMKDIFIPQRMFITDLLWIGRKIWKKLAEILGYTSRRFLSSAEVKWLFC 

LGMRDGLKPSFKYHPCFEQLIYQFQSLTDLIKPLRPVLRQVLFLHRRIAD 



FIG. 51 



9 



55/103 



m o) 
u u 

OJ JJ 
4J D) 
jJ Di 
4-> U 
iJ iJ 
jJ (0 

JJ jJ 
(0 u 
(0 jJ 
(0 ro 
D) iJ 
Di V 
JJ 4J 
O iJ 
JJ jJ 
U 03 
JJ JJ 

(d 

fO to 

fO iJ 

JJ to 

O 01 

u u 

JJ JJ 
u 

m tn 

m d 

t3) J-l 
U JJ 
JJ fO 
U J-) 



U (0 JJ fd 

4J JJ (U 

(0 fO 
CJ)JJ to 
Di to Di 
JJ U JJ 
JJ Di JJ 
to JJ U 
U Cn U 
OJ U D) 



to D) U JJ 



U 
jJ 



JJ 
CD 
JJ JJ 
U Dl 
01 01 
JJ to 
JJ to 



01 to 
to to 



u to 

JJ V 



to to 
JJ to 

JJ V 
jJ to 
to to 

01 JJ 

u to 
u to 
fO u 
JJ JJ 
01 u 
0) to 



U 4J JJ 
U 014J 

JJ u to Oi 

JJ u u u 

JJ to to to 

U JJ JJ to 

to U JJ u 

OJ jJ to U 

JJ JJ U JJ 

JJ JJ JJ to 

to OlJJ u 

to JJ fO JJ 

J-> O) O) to 

JJ U to JJ 

to OiJ-J 01 

JJ U JJ Oi 

JJ U to XJ 

JJ to JJ JJ 

U to to JJ 

to U to 

JJ U U JJ 

JJ 01 to Oi 

jJ to Oi to 

JJ JJ to Dl 

U U 01 01 

fO JJ U JJ 

01 JJ tO to 

to U 01 01 

to to JJ JJ 

to OJ U to 

JJ U JJ u 

U OlJJ 01 

U OJ U U 

U U JJ JJ 

O Oi to 01 

U (0 (0 u 

JJ JJ to to 

JJ JJ U L) 

U iJ to JJ 

U JJ JJ 01 

JJ Oi -u to 

JJ JJ U Oi 

0 U U JJ 
to to JJ to 
to JJ to Oi 

u oi to to 

to 01 to JJ 

to JJ OlJJ 

01 Oi to JJ 
to U U L) 

fO JJ to to 

to to u u 

to to JJ to 

to JJ U JJ 

U JJ OlJJ 

to JJ to u 

Oi JJ JJ u 

01 U 01 01 

to tO to to 

to JJ 01 Oi 

JJ JJ 01 Oi 

to U ^ JJ 

JJ 01 JJ to 

Oi u u u 

to JJ JJ to 

to JJ u to 

U 01 01 JJ 

U Oi to JJ 



OlJJ to 
OJ JJ 01 
to Oi JJ 
U JJ to 
OlJJ to 
to Oi JJ 
JJ JJ u 

a to u 
JJ u u 
OlJJ u 

u u u 

to JJ JJ 
U JJ JJ 

tO to to 

U JJ 
O) u 
JJ JJ 
JJ JJ 

0 JJ JJ 
Dl JJ JJ 
JJ to JJ 
fO JJ OJ 
Dl U iJ 
U JJ u 
4J JJ u 
JJ JJ 
JJ OJ 
JJ JJ 
JJ (0 u 
Dl JJ JJ 
to JJ to 
to JJ to 
to JJ Oi 

01 Oi Oi 
U OlJJ 
01 u u 
U 01 u 
U U JJ 

JJ to u 
to to 

to JJ 

JJ JJ 

01 JJ 
Oi u 
JJ u 
Dl JJ 
01 V 
JJ to 
01 to 

to u 
to 
JJ 
u 



JJ 
u 
JJ 
JJ 
JJ 

to to 
V to 
Dl to JJ 

JJ to to 

JJ to u 

JJ u 

to JJ 

JJ JJ 
JJ to 
to JJ 

to to 

JJ OlJJ 

to to to 

Dl Oi to 

JJ JJ to 

JJ JJ u 

Oi to u 



JJ U JJ o u 
JJ H to O.O 

to O U < I ^ 

JJ a JJ ' , 

01 OiB o 
to H 01^ 

< JJ U ' ^ 

' JJ ^ 
JJ 
JJ 

JJ U I 
to U I 

St 

JJ o < 

to u o 
JJ < o 
Die-" Eh 




01 to o 



< 
u 

DiO 

to u 

U en 

JJ ^ 

JJ H 
U JJ U JJ <! 

U JJ u to o 

U to < to E-" 

< JJ OJ U 

O <6 < O < 
U to < JJ E-H 

a JJ o u < 

O U < " 

Eh Di< 

O 03 O 

E-t to < 

to 2 
u 03 en 
Eh to u to e) 

U U U JJ B 
^ Di< OlO 

JJ Eh JJ ^ 
JJ Eh to O 

JJ < to u 
JJ H u < 



to u 

JJ < 

JJ o 

u o 



^1 

CD 



1 

56/103 



4J O U 

^ u 
^ < 



U U D>4J 

XJ iJ 4J 

jj jj u di 



O C7) 
U (0 



D>-U JJ 0> 
CD XJ (0 (0 

D) (0 (TJ 
U JJ m D) 
U U 4J XJ 
XJ. di U Di 
(0 Di (0 D) 
XJ (0 (0 D) 

m u (0 XJ 

(0 XJ OJ Di 
(0 oJ (0 XJ 
xJ di (0 xJ 

XJ d>xj 
m u o a-> 

D) Di (0 (0 
D> xJ (d 
Cn U rO U 
D)4J DixJ 
D) XJ XJ XJ 
(0 U U U 
(0 Di fC (0 

nj XJ D) 

(0 (0 U XJ 

m u Di u 

(0 OJ XJ U 

XJ d XJ XJ 

U D) U XJ 
01 U XJ 
XJ Di4J 



DlJ-J -U (0 
XJ U OJ XJ 
XJ 0)^ U 
CO (0 Di -U 
XJ U (0 U 
XJ XJ Di 01 

u <d n3 (0 
m XJ oi 4J 

fU JJ Oi Oi 
O) u u u 

to O) D) XJ 

(0 fO ^ u 

(0 XJ (0 XJ 

XJ (0 Oi XJ 

U 01 rO U 

XJ 01 U fO 

O) XJ XJ O) 

(0 (0 m u 

(0 rO to XJ 
Oi fO 01 XJ 
Oi 01 J-J u 

u 01 m 01 

Oi fO 01. u 

m 0i4J 

(0 XJ XJ 

<0 (0 u 

U 01 XJ U 

iJ U CJ (0 

U U XJ XJ 

U fO (0 u 

XJ U U U 

XJ U (0 




(0 U ro 
fO O) fO to 
fO CO Oi Oi 
OixJ XJ 
(0 to Oi^ 
XJ (0 OixJ 
fO XJ 01 
(0 XJ (0 



XJ 
XJ 

XJ (0 (0 ro (0 
rO U (0 01 CO 
U iJ XJ 
U J-) U XJ U 
01 U U U CO 
CO 01 01 U 01 
CO Oi CO XJ U 
XJ CO CO to 01 
XJ 01 01 Oi to 
01 J-J CO 01 to 
XJ to O) U XJ 
01 xJ 01 01 4J 
to XJ (0 CO 01 
XJ U Oi U CO 
to 0)xJ U U 
to (0 U XJ 01 
CO CO 01 U U 



to 

ei 
u: 



3 
I- 

o 
o 



57/103 



o m ^ 
^ ^ ^ 



o 

LD 



CTi 00 CM 
o^ 



o 
o 



(Tt o o 
(M (N ro 

t-t t-t tH 



O 

in 



in CO 
in in in 



00 



i 



PQ bi:; 




t> fcfd w 




HOW 








1 t Of 




J 1 H 


1 
1 


CU H > 




Eh W H 




H S H 








£3 S ^ 




1 t CO 


t 


1 H > 




1 > J 




1 Q i4 








§ H- H 


M 


2 CU 




D 2 ;^ 


i4 


u: Pi cti 




Di >H >r 


>H 


El. >H >^ 


>H 


>H Cm 

> M 


>< 


H {si Of 




Eh W W 


CO 


> >< o 




E-i CO W 




CO t:^; 


^' 


CO ooc 




H Obii 




Da w 1 


w 


&H H 1 




1 




[1. [i. 1 




in 1 


Ei4 



O titi fc^i 

a; cc; 

u S J 

Or Pu Ci4 

M &H E-c 



u I I 
S I t 

H CO 



CO 



Ml 



^ 



s 
^ 

pLi Oi CM 

H H M M 



Pu 



1^ 



CO 



O O 

^ E- 
S < 

PL4 U4 
>H CH 



S Pt] 



CO 



3] 



IS] 



CD 

u: 







c 












1 












(0 




}H 












• a4J 






















CO 




CO 




:3 


a to 




Q) 0) O 


w 


Q) 0) o 


CO 




c 




C 


O CO 


(1) 


0 W 


QJ 


CN .-i C 


CO 


CM .H c; 


CO 


Eh a (13 








CO D ^ 


0 


CO p M 


o 


W W H 


u 


pa w 


u 











QJ 




QJ 




1 








m 












J-l 








a4-> 




0) 0 




QJ QJ 








aa-> 






to 




to 


a to M-t 


:3 


a CO u-t 


:3 


QJ QJ O 


to 


QJ QJ O 


to 


Q^JJ ■ 


G 


aiJ 


C 


0 CO 


QJ 


o to 


QJ 


CM rH C 


to 


CM H C 


to 


Eh a (0 


c 


Eh a m 




CO D V^ 


o 


CO :3 ^1 


0 


pa a Eh 


u 


pa pa Eh 


u 



4 # 

J5971 03 



S-l: FFY VTE TTF QKN RLE FYR KSV WSK 

S-2: RQH LKR VQL RDV SEA EVR QHR EA 

S-3: ART FRR EKR AER LTS RVK ALF SVL NYE 



A-1: AKF LHW LMS VYV VEL LRS FFY VTE TTF Q 
A-2: LFF YRK SVW SKL QSI GIR QHL KRV QLR-DVS. 
A-3: PAL LTS RLR FIP KPD GLR PIV NMD YW 

FIG. 54 
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ta a g c c teg 
5 ' - cag acc aaa gga att cca taa gg -3 ' 
QTKGIPQG 



4(B') 



5(c' ) 



D D y L L I T 
ctg ctg atg gag gag tag tgg -5' 
a a aaaaa a a 
t t t t 
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Poly 1 
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Vector 



Genomic DNA Insert 
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5825 bp Sequenced 
2 kb Hind III Fragment 
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Wild Type Telomerase Gene 
Wild Type Telomerase Gene 



1 .Transform with linear fragment 
containing the telomerase gene 
disrupting with a LEU2 or ura4 marker 



M 



Telomerase Gene::M 



Wild Type Telomerase Gene 




Wild Type Telomerase Gene 



Wild Type (X2) 



2. Assay in selective media 



3. Sporulate, and grow on 
selective media 




Senescence? (X2) 



(These cells will show a senescence phenotype 

if the disrupted gene encodes a telomerase subunit.) 
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heterozygote diploid 
wt diploid 
wt haploid 



Tetrad 



tez1A 




Generations 
after sporulation 



Telomeric 
Apa I 
fragment 




600 bp 
500 bp 
400 bp 

300 bp 
200 bp 

100 bp 
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1 

met ser val tyr val val glu leu leu 
GCCAAGTTCCTGCACTGGCTG - ATG AGT GTG TAG GTC GTC GAG CTG CTC 

10 * 20 
arg ser phe phe tyr val thr glu thr thr phe gin lys asn arg 
AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG 

30 

leu phe phe tyr arg lys ser val trp ser lys leu gin ser ile 
CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT 

40 50 
gly ile arg gin his leu lys arg val gin leu arg glu leu ser 
GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG TCG 

60 

glu ala glu val arg gin his arg glu ala arg pro ala leu leu 
GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG CTG 

70 . 80 . 

thr ser arg leu arg phe ile pro lys pro asp gly leu arg pro 
ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG CCG 

90 • 

ile val asn met asp tyr val val gly ala arg thr phe arg arg 
ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC AGA 

100 110 

glu lys ala glu arg leu thr ^ser arg val lys ala leu phe 

GAA AAG ARG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG TTC 

120 ^ 

ser val leu asn tyr glu arg ala arg arg pro gly leu leu gly 
AGC GTG CTC AAC TAC . GAG CGG GCG CGG CGC CCC GGC CTC CTG GGC 

130 140 

ala ser val leu gly leu asp asp ile his arg ala trp arg thr 
GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC ACC 

150 

phe val leu arg val arg ala gin asp pro pro pro glu leu tyr 
TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG. CCT GAG CTG TAC 

160 170 

phe val lys val asp val thr gly ala tyr asp thr ile pro gin 
TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC CAG 

180 

-asp arg leu thr glu val ile ala ser ile ile lys pro gin asn 
GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 

190 200 

thr tyr cys val arg arg tyr ala val val gin lys ala ala met 

ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC ATG 
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210 



gly thr ser ala arg pro ser arg ala thr ser tyr val gin cys 
GGC ACG TCC GCA AGG COT TCA AGA GCC ACG TCC TAG GTC GAG TGC 



gin gly ile pro gin gly ser ile leu ser thr. leu leu cys ser 
GAG GGG ATC CCG GAG GGC TCC ATC CTC TCC ACG CTG GTC TGC AGC 

240 

leu cys tyr gly asp met glu asn lys leu phe ala gly ile arg 
CTG TGC TAG GGC GAG ATG GAG AAC AAG CTG TTT GCG GGG ATT GGG 

250 260 

arg asp gly leu leu leu arg leu val asp asp phe leu leu val 
GGG GAG GGG CTG CTC CTG CGT TTG GTG GAT GAT TTC TTG TTG GTG 

270 

thr pro his leu thr his ala lys thr phe leu arg thr leu val 
ACA CGT GAG CTC ACG GAG GCG AAA ACG TTC CTC AGG AGC CTG GTC 

280 290 

arg gly val pro glu tyr gly cys val val asn leu arg lys thr 
CGA GGT GTC CGT GAG TAT GGC TGC GTG GTG AAC TTG CGG AAG ACA 

. 300 

val val asn phe pro val glu asp glu ala leu gly gly thr ala 
GTG GTG AAC TTC CCT GTA GAA GAG GAG GCC CTG GGT GGC ACG GGT 

310 320 

phe val gin met pro ala his gly leu phe pro trp cys gly leu 
TTT GTT GAG ATG CCG GCC CAC GGC GTA TTC GCC TGG TGC GGC CTG 



leu leu asp thr arg thr leu glu val gin ser asp tyr ser ser 
CTG CTG GAT ACC CGG ACG CTG GAG GTG GAG AGC GAG TAG TCC AGC 



tyr ala arg thr ser ile arg ala ser leu thr phe asn arg gly 
TAT GCC CGG ACC TCC ATC AGA GCC AGT CTC ACC TTC AAC GGC GGC 

360 

phe lys ala gly arg asn met arg arg lys leu phe gly val leu 
TTC AAG GCT GGG AGG AAC ATG CGT CGC AAA CTC TTT GGG GTC TTG 

370 380 

arg leu lys cys his ser leu phe leu asp leu gin val asn ser 

CGG CTG AAG TGT CAC AGC CTG TTT CTG GAT TTG GAG GTG AAC AGC 

390 

leu gin thr val cys thr asn ile tyr lys ile leu leu leu gin 
CTC GAG ACG GTG TGC ACC AAC ATC TAG AAG ATC CTC CTG CTG CAG 

400 410 

ala tyr arg phe his ala cys val leu gin leu pro phe his gin 

GCG TAG AGG TTT CAC GCA TGT GTG CTG CAG CTC CCA TTT CAT CAG 



220 



230 



330 



340 



350 
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420 



gin val trp lys asn pro his phe ser cys ala ser ser leu thr 

CAA GTT TGG AAG AAC CCA CAT TTT TCC TGC GCG TCA TCT CTG ACA 

430 440 

arg leu pro leu leu leu his pro glu ser gin glu arg arg asp 

CGG CTC CCT CTG CTA CTC CAT CCT GAA AGC CAA GAA CGC AGG GAT 



450 

val ala gly gly gin gly arg arg arg pro ser ala leu arg gly 
GTC GCT GGG GGC CAA GGG CGC CGC CGG CCC TCT GCC CTC CGA GGC 



460 

arg ala val ala val 
CGT GCA GTG GCT GTG 



thr pro cys his leu 
ACA CCG TGT CAC CTA 

490 

pro asp ala ala glu 
CCA GAC GCA GCT GAG 



pro pro ser ile pro 
CCA CCA AGC ATT CCT 

480 

arg ala thr pro gly 
CGT GCC ACT CGT GGG 



ser glu ala pro gly 
TCG GAA ,GCT CCC GGG 



470 

ala gin ala asp ser 
GCT CAA GCT GAC TCG 



val thr gin asp ser 
GTC ACT CAG GAC AGC 

.500 

asp asp ala asp cys 
GAC GAC GCT GAC TGC 



510 

pro gly gly arg ser gin pro gly thr ala leu arg leu gin asp 
CCT GGA GGC CGC AGC CAA CCC GGC ACT GCC CTC AGA CTT CAA GAC 

520 530 

his pro gly leu met ala thr arg pro gin pro gly arg glu gin 
CAT CCT GGA CTG ATG GCC ACC CGC CCA CAG CCA GGC CGA GAG CAG 

540 

thr pro ala ala leu ser arg arg ala tyr thr ser gin gly gly 
ACA CCA GCA GCC CTG TCA CGC CGG GCT TAT ACG TCC CAG GGA GGG 

550 560 

arg gly gly pro his pro gly leu his arg trp glu ser glu ala 
AGG GGC, GGC CCA- CAC CCA GGC CTG CAC CGC TGG GAG TCT GAG GCC 

564 
OP 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 

CTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 

AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT 

CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCTTCGCCCTGCCTTCC 

TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTTTGGG 

AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 



GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 



TTTCAGTTTTGGAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Motif -1 
Ep pl23 
Sp Tezl 
Sc Est2 ■ 
Hs TCPl 
consensus 



. LWSLIRCFFYVTEQQKSYSKT . 
. FIIPILQSFFYITESSDLPNRT . 
. LIPKIIQTFFYCTEISSTVTIV. 
. YWELLRSFFYVTETTFQKNRL . 
FFY TE 



Motif 0 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCPl 
consensus 



K 

p hhh K hR h R 

,KSLGFAPGKLRLIPKKT--TFRPIMTFNKKIV. 
. QKTTLPPAVIRLLPKKN--TFRLITNLRiaiFL . 
. TLSNFNHSKMRIIPKKSNNEFRIIAIPCRGAD . 
, ARPALLTSRLRFrPKPD--GLRPIVNMDYWG-. 
R PK R I 



Motif A 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCPl 
consensus 



AF 

■ h hDh ' GY h 
. PKLFFATMDIEKCYDSVNREKLSTFLK . 
. RKKYFVRIDIKSCYDRIKQDLMFRIVK . 
. PELYFMKFDVKSCYDSIPRMECMRILK. 
. PELYFVKVDVTGAYDTIPQDRLTEVIA. 
F D YD 



.//. 



Motif B 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCPl 
consensus 



hPQG pS hh 
.NGKFYKQTKGIPQGLCVSSILSSFYYA. 
. GNSQYLQKVGIPQGSILSSFLCHFYME . 
. EDKCYIREDGLFQGSSLSAPIVDLVYD . 
. RATSYVQCQGIPQGSILSTLLCSLCYG . 
G QG S 



Motif C 
Ep pl23 - 
Sp Tezl 
Sc Est2 
Hs TCPl 
consensus 



Y 

h F DD hhh 
.PNVNLLMRLTDDYLLITTQENN, . 
.KKGSVLLRWDDFLFITVNKKD. . 
, SQDTLILKLADDFLIISTDQQQ . . 
, RRDGLLLRLVDDFLLVTPHLTH . . 
DD L 



Motif D 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCPl 
consensus 



Gh h cK 
. NVSRENGFKFNMKKL . 
. LNLSLRGFEKHNFST . 
. KKLAMGGFQKYNAKA . 
. LRTLVRGVPEYGCW . 

G 
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1 GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC 

51 CCGCGATGCC GCGCGCTCCC CGCTGCCGAG CCGTGCGCTC CCTGCTGCGC 

101 AGCCACTACC GCGAGGTGCT GCCGCTGGCC ACGTTCGTGC GGCGCCTGGG 

151 GCCCCAGGGC TGGCGGCTGG TGCAGCGCGG GGACCCGGCG GCTTTCCGCG 

201 CGNTGGTGGC CCANTGCNTG GTGTGCGTGC CCTGGGANGN ANGGCNGCCC 

251 CCCGCCGCCC CCTCCTTCCG CCAGGTGTCC TGCCTGAANG ANCTGGTGGC 

301 CCGAGTGCTG CANANGCTGT GCGANCGCGG CGCGAANAAC GTGCTGGCCT 

351 TCGGCTTCGC GCTGCTGGAC GGGGCCCGCG GGGGCCCCCC CGAGGCCTTC 

401 ACCACCAGCG TGCGCAGCTA CCTGCCCAAC ACGGTGACCG ACGCACTGCG 

451 GGGGAGCGGG GCGTGGGGGC TGCTGCTGCG CCGCGTGGGC GACGACGTGC 

501 TGGTTCACCT GCTGGCACGC TGCGCGNTNT TTGtGCTGGT GGNTCCCAGC 

551 TGCGCCTACC ANGTGTGCGG GCCGCCGCTG TACCAGCTCG GCGCTGCNAC 

601 TCAGGCCCGG CCCCCGCCAC ACGCTANTGG ACCCGAANGC GTCTGGGATC 

651 CAACGGGCCT GGAACCATAG CGTCAGGGAG GCCGGGGTCC CCCTGGGCTG 

701 CCAGCCCCGG GTGCGAGGAG GCGCGGGGGC AGTGCCAGCC GAAGTCTGCC 

751 GTTGCCCAAG AGGCCCAGGC GTGGCGCTGC CCCTGAGCCG GAGCGGACGC 

801 . CCGTTGGGCA GGGGTCCTGG GCCCACCCGG GCAGGACGCC TGGACCGAGT 

851 GACCGTGGTT TCTGTGTGGT GTCACCTGCC AGACCCGCCG AAGAAGCCAC 

901 CTCTTTGGAG GGTGCGCTCT CTGGCACGCG CCACTCCCAC CCATCCGTGG 

951 GCCGCCAGCA CCACGCGGGC CCCCCATCCA CATGGCGGCC ACCACGTCCT 

1001 GGGACACGCC TTGTCCCCCG GTGTACGCCG AGACCAAGCA CTTCCTCTAC 

1051 TCCTCAGGCG ACAAGNACAC TGCGNCCCTC CTTCCTACTC AATATATCTG 

1101 AGGCCCAGGC TGACTGGCGT TCGGGAGGTT CGTGGAGACA NTCTTTCTGG 

1151 TTCCAGGCCT TGGATGCCAG GATTCCCCGC AGGTTGCCCC GCCTGCCCCA 

1201 GCGNTACTGG CAAATGCGGC CCCTGTTTCT GGAGCTGCTT GGGAACCACG 

1251 CGCAGTGCCC CTACGGGGTG TTCCTCAAGA CGCACTGCCC GCTGCGAGCT 

1301 GCGGTCACCC CAGCAGCCGG TGTCTGTGCC CGGGAGAAGC CCCAGGGCTC 

1351 TGTGGCGGCC CCCGAGGAGG AGGAACACAG ACCCCCGTCG CCTGGTGCAG 

1401 CTGCTCCGCC AGCACAGCAG CCCCTGGCAG GTGTACGGCT TCGTGCGGGC 

1451 CTGCCTGCGC CGGCTGGTGC CCCCAGGCCT CTGGGGCTCC AGGCACAACG 

1501 AACGCCGCTT CCTCAGGAAC ACCAAGAAGT TCATCTCCCT GGGGAAGCAT 

1551 GCCAAGCTCT CGCTGCAGGA GCTGACGTGG AAGATGAGCG TGCGGGACTG 

1601 CGCTTGGCTG CGCAGGAGCC CAGGGGTTGG CTGTGTTCCG GCCGCAGAGC 

1651 ACCGTCTGCG TGAGGAGATC CTGGCCAAGT TCCTGCACTG GCTGATGAGT 

1701 GTGTACGTCG TCGAGCTGCT CAGGTCTTTC TTTTATGTCA CGGAGACCAC 

1751 GTTTCAAAAG AACAGGCTCT TTTTCTACCG GAAGAGTGTC TGGAGCAAGT 

1801 TGCAAAGCAT TGGAATCAGA CAGCACTTGA AGAGGGTGCA GCTGCGGGAG 

1851 CTGTCGGAAG CAGAGGTCAG GCAGCATCGG GAAGCCAGGC CCGCCCTGCT 

1901 GACGTCCAGA CTCCGCTTCA TCCCCAAGCC TGACGGGCTG CGGCCGATTG 

1951 TGAACATGGA CTACGTCGTG GGAGCCAGAA CGTTCCGCAG AGAAAAGAGG 

2001 GCCGAGCGTC TCACCTCGAG GGTGAAGGCA CTGTTCAGCG TGCTCAACTA 

2051 CGAGCGGGCG CGGCGCCCCG GCCTCCTGGG CGCCTCTGTG CTGGGCCTGG 

2101 -ACGATATCCA CAGGGCCTGG CGCACCTTCG TGCTGCGTGT GCGGGCCCAG 

2151 GACCCGCCGC CTGAGCTGTA CTTTGTCAAG GTGGATGTGA CGGGCGCGTA 

2201 CGACACCATC CCCCAGGACA GGCTCACGGA GGTCATCGCC AGCATCATCA 

2251 AACCCCAGAA CACGTACTGC GTGCGTCGGT ATGCCGTGGT CCAGAAGGCC 

23 01 GCCCATGGGC ACGTCCGCAA GGCCTTCAAG AGCCACGTCT CTACCTTGAC 

2351 AGACCTCCAG CCGTACATGC GACAGTTCGT GGCTCACCTG CAGGANAACA 

2401 GCCCGCTGAG GGATGCCGTC GTCATCGAGC AGAGCTCCTC CCTGAATGAG 

2451 GCCAGCAGTG GCCTCTTCGA CGTCTTCCTA CGCTTCATGT GCCACCACGC 
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2501 CGTGCGCATC AGGGGCAAGT CCTACGTCCA GTGCCAGGGG ATCCCGCAGG 

2551 GCTCCATCCT CTCCACGCTG CTCTGCAGCC TGTGCTACGG CGACATGGAG 

2601 AACAAGCTGT TTGCGGGGAT TCGGCGGGAC GGGCTGCTCC TGCGTTTGGT 

2651 GGATGATTTC TTGTTGGTGA CACCTCACCT CACCCACGCG AAAACCTTCC 

2701 TCAGGACCCT GGTCCGAGGT GTCCCTGAGT ATGGCTGCGT GGTGAACTTG 

2751 CGGAAGACAG TGGTGAACTT CCCTGTAGAA GACGAGGCCC TGGGTGGCAC 

2801 GGCTTTTGTT CAGATGCCGG CCCACGGCCT ATTCCCCTGG TGCGGCCTGC 

2851 TGCTGGATAC CCGGACCCTG GAGGTGCAGA GCGACTACTC CAGCTATGCC 

2901 CGGACCTCCA TCAGAGCCAG TCTCACCTTC AACCGCGGCT TCAAGGCTGG 

2951 GAGGAACATG CGTCGCAAAC TCTTTGGGGT CTTGCGGCTG AAGTGTCACA 

3001 GCCTGTTTCT GGATTTGCAG GTGAACAGCC TCCAGACGGT GTGCACCAAC 

3051 ATCTACAAGA TCCTCCTGCT GCAGGCGTAC AGGTTTCACG CATGTGTGCT 

3101 GCAGCTCCCA TTTCATCAGC AAGTTTGGAA GAACCCCACA TTTTTCCTGC 

3151 GCGTCATCTC TGACACGGCC TCCCTCTGCT ACTCCATCCT GAAAGCCAAG 

32 01 AACGCAGGGA TGTCGCTGGG GGCCAAGGGC GCCGCCGGCC CTCTGCCCTC 
3251 CGAGGCCGTG CAGTGGCTGT GCCACCAAGC ATTCCTGCTC AAGCTGACTC 

33 01 GACACCGTGT CACCTACGTG CCACTCCTGG GGTCACTCAG GACAGCCCAG 
3351 ACGCAGCTGA GTCGGAAGCT CCCGGGGACG ACGCTGACTG CCCTGGAGGC 
3401 CGCAGCCAAC CCGGCACTGC CCTCAGACTT CAAGACCATC CTGGACTGAT 
3451 GGCCACCCGC CCACAGCCAG GCCGAGAGCA GACACCAGCA GCCCTGTCAC 
3501 GCCGGGCTCT ACGTCCCAGG GAGGGAGGGG CGGCCCACAC CCAGGCCCGC 
3551 ACCGCTGGGA GTCTGAGGCC TGAGTGAGTG TTTGGCCGAG GCCTGCATGT 
3601 CCGGCTGAAG GCTGAGTGTC CGGCTGAGGC CTGAGCGAGT GTCCAGCCAA 
3651 GGGCTGAGTG TCCAGCACAC CTGCCGTCTT CACTTCCCCA CAGGCTGGCG 
3701 CTCGGCTCCA CCCCAGGGCC AGCTTTTCCT CACCAGGAGC CCGGCTTCCA 
3751 CTCCCCACAT AGGAATAGTC CATCCCCAGA TTCGCCATTG TTCACCCCTC 
3801 GCCCTGCCCT CCTTTGCCTT CCACCCCCAC CATCCAGGTG GAGACCCTGA 
3 851 GAAGGACCCT GGGAGCTCTG GGAATTTGGA GTGACCAAAG GTGTGCCCTG 
3901 TACACAGGCG AGGACCCTGC ACCTGGATGG GGGTCCCTGT GGGTCAAATT 
3951 GGGGGGAGGT GCTGTGGGAG TAAAATACTG AATATATGAG TTTTTCAGTT 
4001 TTGAAAAAAA AAAAAAAAAA AAAAAAAAA 
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GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCGATGCC 

1 + + + + + + 60 

CGTCGCGACGCAGGACGACGCGTGCACCCTTCGGGACCGGGGCCGGTGGGGGCGCTACGG 

a AALRPAAHVGSPGPGHPRDA 

b QRCVLLRTWEALAPATPAMP- 

C SAASCCARGKPWPRPPPRCR- 

GCGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCT . 

51 + + + + + + 120 

CGCGCGAGGGGCGACGGCTCGGCACGCGAGGGACGACGCGTCGGTGATGGCGCTCCACGA 

a ARSPLPSRA LPAAQPLPRGA 

b RAPRCRAVRSLLRSHYREVL - 

C ALPAA EPCAPCCAATTARCC- 

GCCGCTGGCCACGTTCGTGCGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGC^ 

CGGCGACCGGTGCAAGCACGCCGCGGACCCCGGGGTCCCGACCGCCGACCACGTCGCGCC 

a AAGHVRAAPGAPGLAAGAAR 

b PLAT FVRRLGPQGWRLVQRG- 

C RWPRSCGAW GPRAGGWCSAG- 

GGACCCGGCGGCTTTCCGCGCGNTGGTGGCCCANTGCISPrGGTGTGCGT^ 

181 + + + + + + 240 

CCTGGGCCGCCGAAAGGCGCGCNACCACCGGGTNACGNACCACACGCACGGGACCCTNCN 

a' GPGGFPR?GGP??GVRALG? 

b DPAAFRA?VA?C?VCVPW??- 

c T RRLSARWWP?AWCA CPG??- 

ANGGCNGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTGTCCTGCCTGAANGANCTGGTGGC 

241 + + +-■ + + + 300 

. TNCCGNCGGGGGGCGGCGGGGGAGGAAGGCGGTCCACAGGACGGACTTNCTNGACCACCG 

a ?AAPRRPLLPPGVLPE??GG 
b ??PPA APSFRQVSCL??LVA- 

G?PPPPPPSARCP A* ? ?WWP- 

CCGAGTGCTGCANANGCTGTGCGANCGCGGCGCGAANAACGTGCTGGCCTTCGGCTO^ 

301 + + + + + + 360 

' GGCTCACGACGT^rasrCGACACGCTOGCGCCGCGCTTNTTGCACGACCGG^ 

a PSAA?AVR?RRE?RAGLRLR 

b RVL ??LC?RGA?NVLAFGFA- 

C ECC??CA?AAR?TCWPSASR- 

GCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTA 

361 + + + + + + 420 

CGACGACCTGCCCCGGGCGCCCCCGGGGGGGCTCCGGAAGTGGTGGTCGCACGCGTCGAT 

a AAGRGPRGPPRGLHHQR AQL 

b LLDGARGGPPEAFTTSVRSY- 

C CWTGPAGAPPRPSPPACAAT- 

CCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCG 

421 + + + + + + 480 

GGACGGGTTGTGCCACTGGCTGCGTGACGCCCCCTCGCCCCGCACCCCCGACGACGACGC 

a PAQHGDRRTAG ERGVGAAAA 

b LPNTVTDALRGSGAWGLLLR- 

c CPTR*PTHCGGAGRGGCCCA- 
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a PRGRRRAGSPAGTLR??CAG 

b RVGDDVLVHLLARCA7FVLV- 

C AWATTCWFTCWHAAR7LCWW- 

GGNTCCCAGCTGCGCCTACCANGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGOT 

541 +_-^ + + + + 600 

CCNAGGGTCGACGCGGATGGTNCACACGCCCGGCGGCGACATGGTCGAGCCGCGACG]^ 

a 'GS QLRLP?VRAA^AVPARRC? 

b ?PSCAY?VCGPPLyQLGAAT- 

c ?PAAP T?CAGRRCTSSA L?L- 

TCAGGCCCGGCCCCCGCCACACGCTANTGGACCCGAANGCGTCTGGGATCC^ 

601 — + + + + + + 660 

AGTCCGGGCCGGGGGCGGTGTGCGATNACCTGGGCTTNCGCAGACCCTAGGI^^ 

a SGP APATR?WTR?RLGSNGP 

b QARPPPHA?GPE?VWDPTGL - 

C RPGPRHTL?DP?A SGIQRAW- 

GGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCTGCCAGCCCCGGGTGCGAGGAG 

661 + + + + + + 720 

CCTTGGTATCGCAGTCCCTCCGGCCCCAGGGGGACCCGACGGTCGGGGCCCACGCTCCTC 

a GTIASGRPGSPWAASPGCEE 

b EP*RQGGRGPPGLPAPGARR- 

C NHS VREAG VPLGCQPRVRGG- 

GCGCGGGGGCAGTGCCAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGC 

721 + + + + + + 780 

CGCGCCCCCGTCACGGTCGGCTTCAGACGGCAACGGGTTCTCCGGGTCCGCACCGCGACG 

a ARGQCQPKSAV AQEAQAWRC 

b RGGSASRSLPLPKRPRRGAA- 

c AGAVPA EVCRCPRGPGVALP- 

CCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCC 

781 + + + + + + 840 

GGGACTCGGCCTCGCCTGCGGGCAACCCGTCCCCAGGACCCGGGTGGGCCCGTCCTGCGG 

a P*AGADARWAGVLGPPGQDA 

b PEPERTPVGQGS.WAHPGRTP- 

C LSRSGRPLGRGPGPTRAGR L- 

TGGACCGAGTGACCGTGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCAC 

841 -+ + + + + + 900 

ACCTGGCTCACTGGCACCAAAGACACACCACAGTGGACGGTCTGGGCGGCTTC^^ 

a WTE*PWFLCGVTCQTRRRSH 

b GPSDRGFCVVSPARPAEEAT- 

C DRVTVVSVWCHLPDPPKKPP- 

CTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCA 

901 + + + + + + 960 

GAGAAACCTCCCACGCGAGAGACCGTGCGCGGTGAGGGTGGGTAGGCACCCGGCGGTCGT 

a LFGGCALWHAPLPPIRG PPA 

b SLEGALSGTRHSHPSVGRQH- 

c 'LWRVRSLARATPTHPWAAST- 

CCACGCGGGCCCCCCATCCACATCGCGGCCACCACGTCCTGGGACACGCCTTGTCCCCCG 

961 + + + + + : + 1020 

GGTGCGCCCGGGGGGTAGGTGTAGCGCCGGTGGTGCAGGACCCTGTGCGGAACAGGGGGC 
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a PR GPPIHIAATTSWDTPCPP 

b H A GPPSTSRPPRPGTRLVPR- 

c TRAPHPHRGHHVLGHALSPG- 

GTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGM^ 

1021 + — + + — + + + 1080 

CACATGCGGCTCTGGTTCGTGAAGGAGATGAGGAGTCCGCTGT^^ 

a VYAETKHFLySSGDK?TA?L 

b CT'PRPSTSSTPQAT?^TLRPS- 

C VR RDQALPLLLRRQ?HC?PP- 

CTTCCTACTCAATATATCTGAGGCCCAGCCTGACTGGCGTTCGG^ 

1081 + + + + + + 1140 

GAAGGATGAGTTATATAGACTCCGGGTCGGACTGACCGCAAGCCCTCCAAGC^ 

a LPTQYI*GPA*LAFGRFVET 

b FLLNISEAQPDWRSGGSWR?- 

C- SYSIYLRPSLTGVREVRGD?-- 

NTCTTTCTGGTTCCAGGCCTTGGATGCCAGGATTC 

1141 + + + — -+ + + 1200 

MAGAAAGACCAAGGTCCGGAACCTACGGTCCTAAGGGGCGTCCAACGGGGCGGACGGGGT 

a 7 FLVPGLGCQDSPQVAPPAP 

■b SFWFQALDARIPRRLPRLPQ- 
c LSG 'SRPWMPGFPAGCPACPS- 

GCGOTACTGGCAAATGCGGCCCCTGTTTCTGGAGCTGCT^^ 

1201 + + + + + + 1260 

CGCNATGACCGTTTACGCCGGGGACAAAGACCTCGACGAACCCTTGGTGCGCGTCACG^ 

a A?LA .N AAPVSGAAWEPRAVP 

b RYWQMRPLFLELLGNHAQCP- 

c ?TGKCGPCFWSCLG TTRSAP- 

CTACGGGGTGTTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCC^ 

1261 + + + + + + 1320 

GATGCCCCACAAGGAGTTCTGCGTGACGGGCGACGCTCGACGCCAGT^^ 

a LRGVPQDALPAASCGHPSSR 

b YGVFLKTH CPLR AAVTPAA G- 

C TGCSS RRTARCELRSPQQPV- 

TGTCTGTGCCCGGGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGAAC^^ 

1321 + + + + + + 1380 

ACAGACACGGGCCCTCTTCGGGGTCCCGAGACACCGCCGGGGGCTCCTCCTCCTTC 

a CLCPGEAPGLCGGPRGGGTQ 

b VCAREKPQGSVAAPEEEEHR- 

C SVPGRSPRALWRPPRRRNTD- 

ACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTAC^ 

1381 + + — +^ + + + 1440 

TGGGGGCAGCGGACCACGTCGACGAGGCGGTCGTGTCGTCGGGGACCGTCCACA^^ 

a TPV AWCSC'SASTAAPGRCTA 

b P PSPGAAAPPAQQPLAG VRL-- 

c PRRLVQLLRQHSSPWQVYGF- 

TCGTGCGGGCCTGCCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGG^ 

1441 + + + + + + 1500 

AGCACGCCCGGACGGACGCGGCCGACCACGGGGGTCCGGAGACCCCGAGGTCCGTGTTGC 
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a SCGPAC.AGWCPQASGAPGTT 

b RAGLPAPAGAPRPLGLQAQR- 

C VRACLRRLVPPGLWGSRHNE- 

AACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCAT^ 

1501 + + + + + + 1560 

TTGCGGCGAAGGAGTCCTTGTGGTTCTTCAAGTAGAGGGACCCC 

a NAASSGTPRSSSPWGSMPSS 

b TPLPQEHQEVHLPGEACQAL- 

C RRFLRNT KKFISLGKHAKLS- 

CGCTGCAGGAGCTGACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCT^ 

1561 + + + +— + + 1620 

GCGACGTCCTCGACTGCACCTTCTACTCGCACGCCCTGACGCGAACCGACGCGTC 

a RCR,S*RGR*ACGTAL.GCAGA 

b AAGADVEDERAGL RLAAQEP- 

C LQELTWKMSVRDCAWLRRSP- 

CAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAG^ 

1621 + + + + ^— -+ + 1680 

GTCCCCAACCGACACAAGGCCGGCGTCTCGTGGCAGACGCACTCCTCTAGGACCGGT^^ 

a QGLAVFRPQSTVCVRR.SWPS 

b RGWLCSGRRAPSA*GDPGQV- 

C GVG CVPAAEHRLREEILAKF- 

TCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 

1681 + +- + + + + 1740 

AGGACGTGACCGACTACTCACACATGCAGCAGCTCGACGAGTCCAGAAAGAAAATACAGT 

a- SCTG**VCTSSSC SGLSFMS 

b PALADECVRRRAAQVFLLCH- 

C LHWLMSVyVV^ELLRSFFYVT- 

CGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTC 

1741 — + + + + + + 1800 

GCCTCTGGTGCAAAGTTTTCTTGTCCGAGAAAAAGAT^ 

a R'RPRFKRTGSFSTGRVSGAS 

b GDHVSKEQALFLPEECLEQV- 

c ETTFQKNRLFFYRKSVWSKL- 

TGCAAAGCATTGGAATCAGACAGCACTTGAAGAGGGTGCAGCT^ 

1801 + + + + + + 1860 

ACGTTTCGTAACCTTAGTCTGTCGTGAACTTCTCCCACGTCGACGCCCTC^ 

a CKALES DST*RGCSCGSCRK 

b AKHWN QTALEEGAAA GAVGS- 

C QSIGIRQ HLKRVQLRELSEA- 

CAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTT^ 

1861 + + + + + + 1920 

GTCTCCAGTCCGTCGTAGCCCTTCGGTCCGGGCGGGACGACTGCAGGTCT^ 

a QRSGSIGKPGPPC*RPDSAS 

b RGQAASGSQARPADVQTPLH- 

C EVRQHREARPALLTSRLRFI- 

TCCCCAAGCCTGACGGGCTGCGGCCGATTGTGAACATGGACTACGTCGT^^ 

1921 + + + + + + 1980 

^ AGGGGTTCGGACTGCCCGACGCCGGCTAACACTTGTACCTGATGCAGCACCC 
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a SPSLTGCGRL*TWTTSWEPE 

b PQA*RAAADCEHGLRRGSQN- 

C PKPDGLRPIVN MDYVVGART- 

CGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGT^^ 

1981 + + + + + + 2040 

GCAAGGCGTCTCTTTTCTCCCGGCTCGCAGAGTGGAGCTCCC^^ 

a RSAEKRGPSVSPRG*RHCSA 

b V PQRKEGRASHL.EGEGTVQR-' 

C FRREKRAERLTSRVKALFSV- 

TGCTCAACTACGAGCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGG^ 

2041 + + + -f -+ + 2100 

ACGAGTTGATGCTCGCCCGCGCCGCGGGGCCGGAGGACCCGCGGAGACACGACCCGGACC 

a CSTTSGRGAPASWAPL CWAW 

b AQ LRAGAAPRPPGRLCAGPG- 

C LNYERAR RPGLLGASVLGLD- 

ACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCC^ 

2101 + — + + + + + 2160 

TGCTATAGGTGTCCCGGACCGCGTGGAAGCACGACGCACACGCCCGGGTCCTG^ 

a TISTGPGAPSCCVCGPRTRR 

b RYPQGLAHLRAACAGPGPAA- 

c DIHRAWRTFVLRVRAQDPPP- 

CTGAGCTGTACTTTGTCAAGGTGGATGTGACGGGCGCGTACG^^ 

2161 +-. + + + + + 2220 

• GACTCGACATGAAACAGTTCCACCTACACTGCCCGCGCATGCTGTGGTAO 

a LSCTLSRWM*RARTTPSPRT 

b *AVLCQG GCDGRVR HHPPGQ- 

c ELYFVKVDVTGAYDTIPQDR- 

GGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCJ^GAAC^^ 

2221 + + + + + 2280 

CCGAGTGCCTCCAGTAGCGGTCGTAGTAGTTTGGGGTCTTGTGC 

a GSRRSSPASS NPRTRTACV G - 

b AHGGHRQHHQTPEHVLRASV- 
C LTEVIASIIKPQNTYCVRRY- 

ATGCCGTGGTCCAGAAGGCCGCCCATGGGCACGTCCGCAAGGCCTT^^ 

2281 + + + + + + 2340 

TACGGCACCAGGTCTTCCGGCGGGTACCCGTGCAGGCGTTCCGGAAGTTCTC^ 

a MPWSRRPPMGTSARPSRATS 

b CRGPEGRPWARPQGLQEPRL- 

C AVVQKAAHGHVRKAFKSHVS- 

CTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCT^^ 
2341 + + + + + + 2400 

GATGGAACTGTCTGGAGGTCGGCATGTACGCTGTCAAGCACCGAGTGG^ 

a LP*QTSS RTCDSSWLTCR?T - - 

b YLDRPPAVHATVRGSPAG7Q- 
C TLTDLQPYMRQFVAHLQ7NS- 

GCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGC^ 

2401 -+ + + + + + 2460 

CGGGCGACTCCCTACGGCAGCAGTAGCTCGTCTCGAGGAGGGACTTACTCCGGTCGTCAC 
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a AR*GMPSSSSRAPP*MRPAV 

b PAEGCRRHRAELLPE*GQQW- 

C PLRDAVVIEQSSSLNEAS SG- 

GCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACC^^ 

2461 + + + + + + 2520 

CGGAGAAGCTGCAGAAGGATGCGAAGTACACGGTGGTGCGGCACGCGTAGTCCCC^^ 

a ASSTSSYASCATTPCASGAS 

b PLRRLPTLHVPPRRAHQGQV- 

C LFDVFLRFMCHHAVRIRGKS- 

CCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTG^ 

2521 + + + + + + 2580 

GGATGCAGGTCACGGTCCCCTAGGGCGTCCCGAGGTAGGAGAGGTGCGACGAGACGTCG^ 

a PTSSARGSR RAPSSPRC SAA 

b LRPVPGDPAGLHPLHAALQP- 

C YVQCQGIPQGSI LSTLLCSL- 

TGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGG^ 

2581 + + + + + + 2640 

ACACGATGCCGCTGTACCTCTTGTTCGACAAACGCCCCTAAGCCGCCCTGCCCGACGAGG 

a CATATWRTSCLRGFGGTGCS 

b VLRRHGEQAVCGDSAGRAAP- 

c CYGDMENKLFAGIRRDGLLL- 

TGCGTTTGGTGGATGATTTCTTGTT^ 

2641 + + + + + + 2700 

ACGCAAACCACCTACTAAAGAACAACCACTGTGGAGTGGAGTGGGTGCGCT^^ 

a CVWWMISC'W*HLTSPTR .KPS 

b AFGG*FLVGDTSPHPRENLP- 

C RLVDDFL LVTPH LTHAKTFL- 

TCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAAC^^ 

2701 — + + + + + + 2760 

AGTCCTGGGACCAGGCTCCACAGGGACTCATACCGACGCACCACTTGAACGCCT^ 

a SGPWSEVSLSMAA W*TCGRQ 

b QDPGPR.CP*VWLRGELAEDS- 

C RTLVRGVP,EYGCVVNLRKTV- 

TGGTGAACTTCCCTGTAGAAGACGAGGCCCTGGGTGGCACGGCTT^ 

2761 + + + + + + 2820 

ACCACTTGAAGGGACATCTTCTGCTCCGGGACCCACCGTGCCGAAAACAAGTCTAC^ 

a W*TSL*KTRPWVARLL'fRCR 

b GELPCRR RGPGWHGFCSDAG- 

C VNFPVEDEALGGTAFVQMPA- 

CGCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACGCT^ 

.2821 + + + + + + 2880 

GGGTGCCGGATAAGGGGACCACGCCGGACGACGACCTATGGGCCTGGGACCTCCACGTCT 

a. PTAYSPGAACCWIPGPWRCR 

b PRPIPLVRPAAGYPDP GGAE- 

C HGLFPW,CGLLLDTRTLEVQS- 

GCGACTACTCCAGCTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACCGCGGCT 

2881 + + + + + + 2940 

CGCTGATGAGGTCGATACGGGCCTGGAGGTAGTCTCGGTCAGAGTGGAAGTT^ 
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a ATTPAMPGPPSEPVSPSTAA 

b RLLQLCPDLHQS.QSHLQPRL- 

C DYSSYA RTSIRASLTFNRGF- 

TCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGG 

2941 + + + + + + 3000 

AGTTCCGACCCTCCTTGTACGCAGCGTTTGAGAAACCCCAGAACGCC^^ 

a SRLGGTCVANS LGS CG*SVT 

b QGWEEHASQTLWGLAAEVSQ- 

c K AGRNMR RKLFGVLRLKCHS- 

GCCTGTTTCTGGATTTGCAGGTGAACAGCCTCCAGACGGT^^ 

3001 + + + + — + + 3060 

CGGACAAAGACCTAAACGTCCACTTGTCGGAGGTCTGCCACACGTGGTT^ 

a ACFWI CR*TASRRCAPTSTR 

b P V SGFAGEQPPDGVHQHLQD- 

C LFLDLQV.NSLQTVCTNIYKI- 

TCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCT^ 

3061 + + + + + + 3120 

AGGAGGACGACGTCCGCATGTCCAAAGTGCGTACACACGACGTCGAGGGTAAAGTAGTCG 

a -SSCCR RTGFTHVCCSSHFIS 

b PPAAGVQVSRMCAAAPISSA- 

C LLLQ AYRFHACVLQL PFHQQ- 

AAGTTTGGAAGAACCCCACATTTTTCCTGCGCGT^ 

3121 + + + + + + 3180 

TTCAAACCTTCTTGGGGTGTAAAAAGGACGCGCAGTAGAGACTGTGCCGG^ 

a K. FGRTPHFSCASSL TRPPSA 

b SLE.EPHIFPARHL*HGLPLL- 

C VWKNPTFFLRVISDTASLCY- 

ACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCGC 

3181 — + + + + + + 3240 

TGAGGTAGGACTTTCGGTTCTTGCGTCCCTACAGCGACCCCCGGTTC 

a ,tps*kprtqgcrw'g prappa 
b lhpes qerrdvaggqgrrrp- 

C S I L KAKNAGMS LGAKGAAG P- 

CTCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCJ^GCAT^ . 

3241 + + + + + + 3300 

GAGACGGGAGGCTCCGGCACGTCACCGACACGGTGGTTCGTAAGGACGAGTTCGACT^ 

a LCPPRPCSGCATKH -SC,SS*L 

b SAL-R GRAVAVPPSIPAQADS- 

C LPSEAVQWLCHQAFLLKLTR-- 

GACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAGACG^ 

3301 + + + + + + 3360 

CTGTGGCACAGTGGATGCACGGTGAGGACCCCAGTGAGTCCTGTCG^ 

a DTVS .PT CHSWGHSGQPRRS* - 

b TPCHLRATPGVTQDSPDAAE- 
C HRVTYVPLLGSLRTAQTQLS- 

GTCGGAAGCTCCCGGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAACCCGC^ 

3361 + + + + + + 3420 

CAGCCTTCGAGGGCCCCTGCTGCGACTGACGGGACCTCCGGCGTCGGTTGGGCC^ 
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a VGSSRGRR*LPWRPQPTRHC - . 

b S EAPGDD ADCPGGRSQPGTA- 

C RKLPGTTLTALEAAANPALP- 

CCTCAGACTTCAAGACCATCCTGGACTGATGGCCACCCGCC 

3421 + + + + + + 3480 

GGAGTCTGAAGTTCTGGTAGGACCTGACTACCGGTGGGCGGGTGTCGG 

a PQTSRPSWTDGHPPTARPRA 

b LRLQDHPGLMATRPQPGREQ- 

c SDFKTILD*WPPAHSQAESR- 

GACACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACAC 

3481 + + • + + + + 3540 

CTGTGGTCGTCGGGACAGTGCGGCCCGAGATGCAGGGTCCCTCCCTCCCCGCCGGGTC 

a DTSSPVTPGSTSQGGRG G PH 

b TPAALSRRA L RPREGGAA HT- 

C HQQPCHAGLYVPGREGRPTP- 

, CCAGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGC 

3541 + + + + + + 3600 

GGTCCGGGCGTGGCGACCCTCAGACTCCGGACTCACTCACAAACCGGCTCCGGACGT^ 

a PG PHRWESEA*VSVWPRPAC 

b QARTAGSLRPE*VFGRGL HV- 

C RP APLGV*GLS ECLAEACMS- 

CCGGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCC^ 

3601 + + + + + + 3660 

GGCCGACTTCCGACTCACAGGCCGACTCCGGACTCGCTCACAGGTCGGTTCCC^^ 

a PAEG*VSG*GLSECPAKG*V 

b RLKAECPAEA.*ASVQPRAEC- 

C G*RL SVRLRPERVSSQGLSV- 

TCCAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTC 

3661 + + + : + + + 3720 

AGGTCGTGTGGACGGCAGAAGTGAAGGGGTGTCCGACCGCGAGCCGAGGTGGGGTC 

a SSTPAVFTSPQA.GARLHPRA 

b PAHLPSSLP HRLALGSTPGP- 

C QHTCRLH, FPTGWRSAPPQGQ- 

AGCTTTTCCTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAAT^^ 

3721 + — -+ + + + + 3780 

TCGAAAAGGAGTGGTCCTCGGGCCGAAGGTGAGGGGTGTATCCTTATCATO 

a SFSSPGARLPLPT*E*SIP R 

b AFPHQEPGFHSPHRNSPSPD- 

C LFLTRSPASTPHIGIVHPQI- 

TTCGCCATTGTTCACCCCTCGCCCTGCCCTCCTTTGC 

3781 + + + +- + + 3840 

AAGCGGTAACAAGTGGGGAGCGGGACGGGAGGAAACGGAAGGTGGGGGTGGTAGG 

a FAIVHPSPCPPLPSTPTIQV 

b SPLFTPRPALLCLPPPPSRW - 

c RHCSPLALPSFAFHPHHPGG- 

(^GACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACC;^ 

3841 + + + + + + 3900 

CTCTGGGACTCTTCCTGGGACCCTCGAGACCCTTAAACCTCACTGGT^ 
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a ETLRRTLGALGIWSDQRCAL 

b RP*EGPWELWEFGVTKGVPC- 

C DPEKDPGSSGNLE*P.KVCPV- 

TACACAGGCGAGGACCCTXSCACCTGGATGGGGGTCCCTGTGGGT^^ 

3901 + + + + + + 3960 

ATGTGTCCGCTCCTGGGACGTGGACCTACCCCCAGGGACACCCAGTTTAAC 

a YTGEDPAPGWGSLWVKLGGG 

b TQARTLHLDGG PCGSNWGEV - 

C HRR GPCTWMGVPVGQIGGRC- 

GCTGTGGGAGTAAAATACTGAATATATGAGTTTTTCT^GTT^^ 

3961 + + + +— + + 4020 

CGACACCCTCATTTTATGACTTATATACTCAAAAAGTCAAAACT^^ 

a AVGVKY*IYEFFSFEKKKKK 

b LW. E*NTEyMSFSVLKKKKKK- 

C CGS KILN.I*V .F.QF*KKKKKK- 

AAAAAAAAA 
4021 4029 



a K K K 

b K K 

c K K 
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1 

met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 ' 

pro arg ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGG AGO 

20 30 
his tyr arg. glu val leu pro leu ala thr phe val arg arg leu 
CAC TAG CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

gly pro gin gly trp arg leu val gin arg gly asp pro ala ala 
GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 

50 60 
phe arg ala leu val ala gin eys leu val cys val pro trp asp 
TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC. GTG CCC TGG GAC 

70 

ala arg pro pro pro ala ala- pro ser phe arg gin val ser cys 
GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

80 90 
leu lys glu leu val ala arg val leu gin arg leu cys glu arg 
CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 
GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 



110 120 - 

ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 



130 

tyr leu pro asn thr val thr asp ala leu arg gly ser gly ala 
TAG CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 



140 

trp gly leu leu leu arg arg 
TGG GGG CTG CTG CTG CGC CGC 



leu leu ala arg cys ala leu 
CTG CTG GCA CGC TGC GCG CTC 

170 

ala tyr gin val cys gly pro 
GCC TAG CAG GTG TGC GGG CCG 



thr gin ala arg pro pro pro 
ACT CAG GCC CGG CCC CCG CCA 



150 

val gly asp asp val leu val his 
GTG GGC GAC GAC GTG CTG GTT CAC 

160 

phe val leu val ala pro ser cys 
TTT GTG CTG GTG GCT CCC AGC TGC 

180 

pro leu tyr gin leu gly ala ala 
CCG CTG TAC CAG CTC GGC GCT GCC 

190 

his ala ser gly pro arg arg arg 
CAC GCT AGT GGA CCC CGA AGG CGT 
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200 

leu gly cys glu arg 
CTG GGA TGC GAA CGG 



val pro leu gly leu 
GTC CCC CTG GGC CTG 

230 

ser ala ser arg ser 
AGT GCC AGC CGA AGT 



ala ala. pro glu pro 
GCT GCC CCT GAG CCG 

260 

ala his pro gly arg 
GCC CAC CCG GGC AGG 



val val ser pro ala 
GTG GTG TCA CCT GCC 
290 

gly ala leu ser gly 
GGT GCG CTC TCT GGC 



gin his his ala gly 
CAG CAC CAC GCG GGC 

320 

trp asp thr pro cys 
TGG GAC ACG CCT TGT 



leu tyr ser ser gly 
CTC TAC TCC TCA GGC 

350 

leu ser. ser leu arg 
CTC AGC TCT CTG AGG 



glu thr ile phe leu 
GAG ACC ATC TTT CTG 

380 

arg arg leu pro arg 
CGC AGG TTG CCC . CGC 



leu phe leu glu leu 
CTG TTT CTG GAG CTG 

410 

val leu leu lys thr 
GTG CTC CTC AAG ACG 
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ala trp asn his ser 
GCC TGG AAC CAT AGC 

220 

pro ala pro gly ala 
CCA GCC CCG GGT GCG 



leu pro leu pro lys 
CTG CCG TTG CCC AAG 

250 

glu arg thr pro val 
GAG CGG ACG CCC GTT 



thr arg gly pro ser 
ACG CGT GGA CCG AGT 

280 

arg pro ala glu glu 
AGA CCC GCC GAA GAA 

thr arg his ser his 
ACG CGC CAC TCC CAC 

310 

pro pro ser thr ser 
CCC CCA TCC ACA TCG 



pro pro val tyr ala 
CCC CCG GTG TAC GCC 

340 

asp lys glu gin leu 
GAC AAG GAG CAG CTG 



pro ser leu thr gly 
CCC AGC CTG ACT GGC 

370 

gly ser arg pro trp 
GGT TCC AGG CCC TGG 



leu pro gin arg tyr 
CTG CCC CAG CGC TAC 

400 

leu gly asn his ala 
CTT GGG AAC CAC GCG 



his cys pro leu arg 
CAC TGC CCG CTG CGA 
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210 

val arg glu ala gly 
GTC AGG GAG. GCC GGG 



arg arg arg gly gly 
AGG AGG CGC GGG GGC 

240 

arg pro arg arg gly 
AGG CCC AGG CGT GGC 



gly gin gly ser trp 
GGG CAG GGG TCC TGG 

270 

asp arg gly phe cys 
GAC CGT GGT TTC TGT 



ala thr ser leu glu 
GCC ACC TCT TTG GAG 

. 300 

pro ser val gly arg 
CCA TCC GTG GGC CGC 



arg pro pro arg pro 
CGG CCA CCA CGT CCC 

330 

glu thr lys his phe 
GAG ACC AAG CAC TTC 



arg pro ser phe leu 
CGG CCC TCC TTC CTA 

360 

ala arg arg leu val 
GCT CGG AGG CTC GTG 



met pro gly thr pro 
ATG CCA GGG ACT CCC 

390 

trp gin met arg pro 
TGG CAA ATG CGG CCC 



gin cys pro tyr gly 
CAG TGC CCC TAC GGG 

420 

ala ala val thr pro 
GCT GCG GTC ACC CCA 



ala 
GCA 


ala 
GCC 


gly 
GGT 


val 
GTC 


cys 
TGT 


ala 
GCC 


pro 
CCC 


glu 
GAG 


440' 
glu 
GAG 


glu 
GAG 


leu 
CTC 


arg 
CGC 


gin 
CAG 


his 
CAC 


ser 
AGC 


ala 
GCC 


cys 
TGC 


leu 
CTG 


470 
arg 
CGC 


arg 
CGG 


his 
CAC 


asn 
AAC 


glu 
GAA 


arg 
CGC 


arg 
CGC 


leu 
CTG 


gly 

GGG 


lys 
AAG 


5.00 
his 
CAT 


ala 
GCC 


met 

atg' 


ser 
AGC 


val 
GTG 


arg 
CGG 


asp 
GAC 


gly cys 

ggc tgt 


val 
GTT 


530 
pro 
CCG 


ala 
GCC 


ala 
GCC 


lys 
AAG 


phe 
TTC 


leu 
CTG 


his 
CAC 


leu 
CTC 


arg 
AGG 


ser 
TCT 


560 
phe 
TTC 


phe 
TTT 


arg 
AGG 


leu 
CTC 


phe 
TTT 


phe 
TTC 


tyr 
TAC 


ile 
ATT 


gly 
GGA 


ile 
ATC 


590 
arg 
AGA 


gin 
CAG 


ser 
TCG 


glu 
GAA 


ala 
GCA 


glu 
GAG 


val 
GTC 


leu 
CTG 


thr 
ACG 


ser 
TCC 


620 
arg 
AGA 


leu 
CTC 


pro 
CCG 


ile 
ATT 


val 
GTG 


asn 
AAC 


met 
ATG 
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430 



ala 
GCC 


arg 
CGG 


glu 
GAG 


lys 
AAG 


pro 
CCC 


asp 
GAC 


thr 
ACA 


asp 
GAC 


pro 
CCC 


arg 
CGT 


ser 
AGC 


pro 
CCC 


trp 
TGG 


460 

gin val 
CAG GTG 


leu 
CTG 


val 
GTG 


pro 
CCC 


pro 
CCA 


gly 
GGC 


phe 
TTC 


leu 
CTC 


arg 
AGG 


490 
asn 
AAC 


thr 
ACC 


lys 
AAG 


leu 
CTC 


ser 
TCG 


leu 
CTG 


gin 
CAG 


cys 
TGC 


ala 
GCT 


trp 
TGG 


520 
leu 
CTG 


arg 
CGC 


ala 
GCA 


glu 
GAG 


his 
CAC 


arg 
CGT 


leu 
CTG 


trp 
TGG 


leu 
CTG 


met 
ATG 


550 
ser 


val 

GTG 


tyr 
TAT 


val 
GTC 


thr 
ACG 


glu 
GAG 


thr 
ACC 


arg 
CGG 


pro 
CCG 


ser 
AGT 


580 
val 
GTC 


trp 
TGG 


nis 
CAC 

arg 
AGG 


leu 
TTG 

gin 
CAG 


lys 
AAG 

his 
CAT 


arg 
AGG 
610 
arg 
CGG 


val 
GTG 

glu 
GAA 


arg 
CGC 


phe 
TTC 


ile 
ATC 


pro 
CCC 


lys 
AAG 


asp 
GAC 


tyr 
TAC 


val 
GTC 


640 
val 
GTG 


gly 
GGA 
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gin gly ser val ala 
CAG GGC TCT GTG GCG 

450 

arg leu val gin leu 
CGC CTG GTG CAG CTG 



tyr gly phe val arg 
TAC GGC TTC GTG CGG 

480 

leu trp gly ser arg 
CTC TGG GGC TCC AGG 



lys lys phe ile ser 
AAG AAG TTC ATC TCC 

510 

glu leu thr trp lys 
GAG CTG ACG TGG AAG 



arg ser pro gly val 
AGG AGC CCA GGG GTT 

540 

arg glu glu ile leu 
CGT GAG GAG ATC CTG 



tyr val val glu leu 
TAC GTC GTC GAG CTG 

570 

thr phe gin lys asn 
ACG TTT CAA AAG AAC 



ser lys leu gin ser 
AGC AAG TTG CAA AGC 

600 

gin leu arg glu leu 
CAG CTG CGG GAG CTG 

ala arg pro ala leu 
GCC AGG CCC GCC CTG 

630. 

pro asp gly leu arg 
CCT GAC GGG CTG CGG 



ala arg thr phe arg 
GCC AGA ACG TTC CGC 



arg glu lys arg ala glu arg leu thr ser arg val lys ala leu 
AGA GAA AAG AGG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG 

670 

phe ser val' leu asn tyr glu arg ala arg arg pro gly leu leu 
TTC AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG 

680 690 
gly ala ser val leu gly leu asp asp ile his arg ala trp arg 
GGC GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC 

700 

thr phe val leu arg val arg ala gin asp pro pro pro glu leu 
ACC TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG 

■ 710 720 
tyr phe val lys val asp val thr gly ala tyr asp thr ile pro 
TAC TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC 

730 

gin asp arg leu thr glu val ile ala ser ile ile lys pro glh 
CAG GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG 

740 750 
asn thr tyr cys val arg arg tyr ala val val gin lys ala ala 
AAC ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC 

760 

his gly his val arg lys ala phe lys ser. his val ser thr leu 
CAT GGG CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC TCT ACC TTG 

770 780 
thr asp leu gin pro tyr met arg gin phe val ala his leu gin 
AC A GAC CTC CAG CCG TAC ATG CGA CAG TTC GTG GCT CAC CTG CAG 

790 

glu thr ser pro leu arg asp ala val val ile glu gin ser ser 
GAG ACC AGC CCG CTG AGG GAT GCC GTC GTC ATC GAG CAG AGC TCC 

,800 810 
ser leu asn glu ala ser ser gly leu phe asp val phe leu arg 
TCC CTG AAT GAG GCC AGC AGT GGC CTC TTC GAC GTC TTC CTA CGC 

820 

phe met cys his his ala val arg ile arg gly lys ser tyr val 
TTC ATG TGC CAC CAC GCC GTG CGC ATC AGG GGC AAG TCC TAC GTC 

830 840 
gin cys gin gly ile pro gin gly ser ile leu ser thr leu leu 
CAG TGC CAG GGG ATC CCG CAG GGC TCC ATC CTC TCC ACG CTG CTC 

850 

cys ser leu cys tyr gly asp met glu asn lys leu phe ala gly 
TGC AGC CTG TGC TAC GGC GAC ATG GAG AAC AAG CTG TTT GCG GGG 

860 870 
ile arg arg asp gly leu leu leu arg leu. val asp asp phe leu 
ATT CGG CGG GAC GGG CTG CTC CTG CGT TTG GTG GAT GAT TTC TTG 
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880 

leu val thr pro his leu thr his. ala lys thr phe leu arg thr 
TTG GTG ACA CCT CAC CTC ACC CAC GCG AAA ACC TTC CTC AGG ACC 

890 900 
leu val arg gly val pro glu tyr gly cys val val asn leu arg 
CTG GTC CGA GGT GTC CCT GAG TAT GGC TGC GTG GTG AAC TTG CGG 



910 



lys 


thr 


val 


val 


asn 


phe 


pro 


val 


glu 


asp 


glu 


ala 


leu 


gly 


gly 


AAG 


ACA 


GTG 


GTG 


AAC 


TTC 


CCT 


GTA 


GAA 


GAC 


GAG 


GCC 


CTG 


GGT 


GGC 








920 




















930 




thr 


ala 


phe 


val 


gin 


met 


pro 


ala 


his 


gly 


leu 


phe pro 


trp 


cys 


ACG 


GCT 


TTT 


GTT 


CAG 


ATG 


CCG 


GCC 


CAC 


GGC 


CTA 


TTC 


CCC 


TGG 


TGC 


















940 














gly 


leu 


leu 


leu 


asp 


thr 


arg 


thr 


leu 


glu 


val 


gin 


ser 


asp 


tyr 


GGC 


CTG- 


.CTG 


CTG 


GAT 


ACC 


CGG 


ACC 


CTG 


GAG 


GTG 


CAG 


AGC 


GAC 


TAC 








950 




















960 




ser 


ser 


tyr 


ala 


arg 


thr 


ser 


ile 


arg 


ala 


ser 


val 


thr 


phe 


asn 


TCC 


AGC 


TAT 


GCC 


CGG 


ACC 


TCC 


ATC 


AGA 


GCC 


ALT 


GTC 


ACC 


TTC 


AAC 


















970 














arg 


gly 


phe 


lys , 


ala 


gly 


arg 


asn 


met 


arg 


arg 


lys 


leu 


phe 


gly 


CGC 


/~\ f~\ o 
GGC 


TTC 


AAG 


GCT 


/-I /-I 

GGG 


AGG 


AAC 


ATG 


CGT 


CGC 


AAA 


CTC 


TTT 


GGG 








980 




















990 




val 


leu 


arg 


leu 


lys 


cys 


his 


ser 


leu phe 


leu 


asp 


leu 


gin 


val 


GTC 


TTG 


CGG 


CTG 


AAG 


TGT 


CAC 


AGC 


CTG 


TTT 


CTG 


GAT 


TTG 


CAG 


GTG 


















1000 












asn 


ser 


leu 


gin 


thr 


val 


cys 


thr 


asn 


ile 


tyr 


lys 


ile 


leu 


leu 


TV 7\ 




LTL 


CAG 


ACG 


GTG 


TGC 


ALL 


AAC 


ATC 


i AL 


AAG 


ATC 


CTC 


CTG 








1010 


















1020 


leu 


gin 


ala 


tyr 


arg 


phe 


his 


ala 


cys 


val 


leu 


gin 


leu 


pro 


phe 


prpp 


Pap 




TAC 


AGG 


TTT 


P2iP 


pp A 


TGT 


GTG 


PTP 


CAG 


CTC 


CCA 


TTT 


















1030 












his 


glri 


gin 


val 


trp 


lys 


asn 


pro 


thr phe 


phe 


leu 


arg 


val 


ile 


CAT 


CAG 


CAA 


GTT 


TGG 


AAG 


AAC 


CCC 


ACA 


TTT 


TTC 


CTG 


CGC 


GTC 


ATC 








1040 


















1050 


ser 


asp 


thr 


ala 


ser 


leu 


cys 


tyr 


ser 


ile 


leu 


lys 


ala 


lys 


asn 


TCT 


GAC 


ACG 


GCC 


TCC 


CTC 


TGC 


TAC 


TCC 


ATC 


CTG 


AAA 


GCC 


AAG 


AAC 


















1060 












ala 


gly 


met 


ser 


leu 


gly 


ala 


lys 


gly ala 


ala 


gly pro 


leu 


pro 


GCA 


GGG 


ATG 


TCG 


CTG 


GGG 


GCC 


AAG 


GGC 


GCC 


GCC 


GGC 


CCT 


CTG 


CCC 








1070 


















1080 


ser 


glu 


ala 


val 


gin 


trp 


leu 


cys 


his 


gin 


ala 


phe 


leu 


leu 


lys 


TCC 


GAG 


GCC 


GTG 


CAG 


TGG 


CTG 


TGC 


CAC 


CAA 


GCA 


TTC 


CTG 


CTC 


AAG 


















1090 












leu 


thr 


arg 


his 


arg 


val 


thr 


tyr 


val 


pro 


leu 


leu 


gly 


ser 


leu 


CTG 


ACT 


CGA 


CAC 


CGT 


GTC 


ACC 


TAC 


GTG 


CCA 


CTC 


CTG 


GGG 


TCA 


CTC 
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■ 1100 1110 
arg thr ala gin thr gin leu ser arg lys leu pro gly thr thr 
AGG ACA GCC GAG ACG GAG CTG AGT GGG AAG CTG CCG GGG AGG ACG 

1120 

leu thr ala leu glu ala ala ala asn pro ala leu pro ser asp 
CTG AGT GGG GTG GAG GGG GGA GGG AAG GGG GGA GTG CCG TGA GAG 

1130 .1132 
phe lys thr ile leu asp OP 

TTG AAG AGG ATG GTG GAC TGA TGGCCACCGGGGGACAGGCAGGGGGAGAGGAGA 



CAGGAGGAGCGCTGTGACGGCGGGGTGTAGGTGGCAGGGAGGGAGGGGCGGGGGAGAGGG 
AGGCGGGCACGGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGGCGAGGCCTGCATGTGG 
GGCTGAAGGGTGAGTGTGCGGCTGAGGCGTGAGCGAGTGTCCAGCCAAGGGGTGAGTGTG 
GAGCAGAGGTGCGGTGTTGACTTGGCGAGAGGGTGGGGCTGGGGTGGACGCCAGGGCGAG 
GTTTTGYTGACCAGGAGGGGGGGTTGGAGTCCCCAGATAGGAATAGTGCATGGGGAGATT 
CGGGATTGTTCACCGYTCGGCGTGGGYTCGTTTGCCTTCCACCGCCAGCATGCAGGTGGA 
GACCGTGAGAAGGAGGCTGGGAGCTGTGGGAATTTGGAGTGAGGAAAGGTGTGCGCTGTA 
GAGAGGCGAGGACCCTGGAGCTGGATGGGGGTGCCTGTGGGTGAAATTGGGGGGAGGTGC 
TGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGRAAAAAAAAAAAAAAAAAA 
AAAAAAAAAA 
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